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Abstract. This article establishes that the split decomposition of graphs introduced by 
Cunnigham, is definable in Monadic Second-Order Logic. This result is actually an instance 
of a more general result covering canonical graph decompositions like the modular decom- 
position and the Tutte decomposition of 2-connected graphs into 3-connected components. 
As an application, we prove that the set of graphs having the same cycle matroid as a given 
2-connected graph can be defined from this graph by Monadic Second-Order formulas. 



1. Introduction 

Hierarchical graph decompositions are useful for the construction of efficient algorithms, 
and also because they give structural descriptions of the considered graphs. Cunningham 
and Edmonds have proposed in 18 a general framework for defining decompositions of 
graphs, hypergraphs and matroids. This framework covers many types of decompositions. 
Of particular interest is the split decomposition of directed and undirected graphs defined 
by Cunningham in |17j . 

A hierarchical decomposition of a certain type is canonical if, up to technical details 
like vertex labellings, there is a unique decomposition of a given graph (or hypergraph, or 
matroid) of this type. To take well-known examples concerning graphs, the modular decom- 
position is canonical, whereas, except in particular cases, there is no useful canonical notion 
of tree- decomposition of minimal tree-width. The general results of JH1 define canonical 
decompositions. 

The present article shows that many of these canonical decompositions can be defined 
by monadic second-order (MS) formulas "inside" the considered graphs or hypergraphs (we 
do not consider decompositions of matroids in this article). More precisely, we prove that 
under the quite natural and generally satisfied hypothesis that the elementary decomposition 
steps are definable by an MS formula, the mapping from a graph or a hypergraph to the 
tree representing its canonical decomposition (of the type under consideration) is a monadic 
second-order (MS) transduction, i.e., a transformation of relational structures defined by 
MS formulas. Furthermore, in many concrete certain relational structure based 
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on this tree represents the considered decomposition, in such a way that the decomposed 
graph can be reconstructed from it. We call it a graph representation of the decomposition 
in the case where it uses relations of arity at most 2. Otherwise, we call it a hypergraph 
representation (because relational structures can be viewed as directed labelled ranked 
hypergraphs). These representations can be constructed from the graphs (equipped with 
arbitrary linear orderings of their sets of vertices or edges) by MS transductions. Roughly 
speaking, we obtain that, from the point of view of MS logic, a graph is equivalent to the 
graph or hypergraph representation of its canonical decomposition, which means that an 
MS property of the canonical decomposition of a graph G is (equivalent to) an MS property 
of G and conversely, that an MS property of G is (equivalent to) an MS property of the 
(hyper)graph representation of its canonical decomposition. 

This article contributes to the understanding of the power of MS logic for represent- 
ing graph properties and graph theoretical notions like canonical graph decompositions 
and equivalences on graphs. When a graph property is expressible in MS logic, it can be 
checked in polynomial time on graphs of bounded tree- width or clique-width. When a graph 
transformation is expressible in MS logic, it preserves the property that a set has bounded 
tree-width or clique-width. We refer the reader to ^0] and [H] for detailed expositions of 
these consequences. 

Why are canonical decompositions interesting? Canonical decompositions and their 
(hyper)graph representations are interesting for several reasons. 

First they contain useful informations on the structure of the graphs. This structural 
information has two forms: the tree, and the elementary graphs from which the considered 
graph is built. In most cases, hierarchical decompositions can be viewed as constructions of 
graphs or hypergraphs by means of particular composition operations (like graph substitution 
in the case of modular decomposition) in terms of prime graphs or hypergraphs, i.e., of those 
which are undecomposable. We will discuss this "algebraic" aspect whenever relevant. 

Second, hierarchical graph decompositions are useful for the construction of polynomial 
algorithms. For example, the first step of the polynomial algorithm recognizing circle graphs 
by Bouchet [21 consists in constructing the split decomposition of the given graph. It uses 
the fact that a graph is a circle graph if and only if each component of its split decomposition 
is a circle graph. The planarity testing algorithm by Hopcroft and Tarjan |25| begins with 
the decomposition of a graph into 3-connected components. Hence a good understanding 
of hierarchical graph structure is useful for constructing low degree polynomial algorithms. 

Third the (hyper)graph representation of the canonical decomposition of a graph re- 
quires in many cases less space to be stored than the given graph. 

Finally, canonical decompositions are also useful for establishing logical properties. For 
example, it is proved in that Seese's Conjecture holds for interval graphs, and that 
it holds in general if and only if it holds for comparability graphs. The proof makes an 
essential use of modular decompositions. (Seese's Conjecture says that if a set of graphs has 
a decidable satisfiability problem for MS logic, then it has bounded clique-width. A slight 
weakening of this conjecture is established in jlfij). 

The companion article [0] develops an application of split decomposition to circle graphs 
that we review briefly. A circle graph is the intersection graph of a set of chords of a circle. 
A graph is a circle graph if and only if all components of its split decomposition are circle 
graphs. Those components which are prime are uniquely representable as intersection graphs 
of sets of chords. It is proved in 6^ that the unique representation of a prime circle graph 
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can be defined by MS formulas (one uses a description of sets of chords by finite relational 
structures). From the split decomposition of a circle graph G and the chord representations 
of its prime components, one can define all chord representations of G. And this can be 
done by MS formulas, since the split decomposition and the chord representations of the 
prime components of G can be defined by MS formulas. Hence, from a given circle graph, 
one can define by MS formulas (using also linear orders of the sets of its vertices), all 
chord representations of this graph. (To be precise, this construction rests upon a result by 
Courcelle and Oum |16j which uses MS formula with set predicates of the form Even(X) 
expressing that a set X has even cardinality.) 

In the present article, we prove a result of the same general form: all 2-connected graphs 
equivalent to a given graph G, in the sense that they have the same cycle matroid, can be 
defined from G and the linear orderings of its vertices by a fixed MS transduction. As for 
circle graphs, the proof uses a canonical decomposition of the considered graph, constructed 
by MS formulas, from which can be defined all the equivalent graphs. This construction is 
based on Whitney's 2-isomorphism theorem, which characterizes the graphs equivalent to 
G as those derived from G by a sequences of transformations called twistings. 

Main results and overview of the article. First, we give a general set theoretical 
and logical setting in which decompositions of combinatorial structures can be defined. 
This framework covers actually two cases. In the first case, studied in Section the 
decomposition tree is rooted and directed. The fundamental example is the very well- 
known modular decomposition. The decompositions of this type correspond to definitions 
of graphs by algebraic expressions based on graph operations. In Section 12 we consider the 
second case, where the decomposition tree is unrooted and undirected. In both cases, and 
under easily applicable conditions, we prove that the decomposition tree is definable by MS 
formulas, which generalizes the MS definability result of the modular decomposition of 
In Section |I] we develop the application to the split decomposition of Cunnigham ^7] and 
we prove its MS definability, which is our second main result. We do not assume the reader 
familiar with this notion and this section presents it in detail. We prove in Section |2] the 
"logically effective" version of the above mentioned theorem by Whitney. Appendices 1 and 
2 review definitions, basic properties and technical lemmas on MS logic, MS transductions 
and clique-with. This work has been presented at the International Conference on Graph 
Theory, Hyeres, France, in September 2005. 

2. Partitive families of sets 

Trees, graphs and relational structures are finite. Two sets meet if they have a nonempty 
intersection. They overlap if they meet and are incomparable for inclusion. We write A _L B 
if and only if A and B overlap. The terms minimal, least, and maximal applied to sets refer, 
unless otherwise specified, to inclusion. 

2.1. Rooted trees and families of sets. We define the family of sets associated with a 
partition of a set V, the blocks of which form a rooted tree. This is a generalization of the 
modular decomposition of a graph where V is its set of vertices. 



4 



B. COURCELLE 



Definition 2.1. Set families and trees. A rooted tree T has its edges directed so that every 
node is accessible from the root by a unique directed path. The leaves are the nodes of 
outdegree 0. The other nodes are the internal nodes. The set of nodes is denoted by Nt 
and the set of internal nodes by iV^ n *. Although a tree is a graph, we will use the term 
"nodes" for the vertices of a tree (or a forest). This particular terminology will be useful 
for clarity in situations where we discuss simultaneously a graph and a tree representing it. 
A rooted tree is proper if no node has outdegree 1, hence if every node is either a leaf, or 
has at least two sons. We denote the son relation by sout- 

Let T be a rooted tree and V = (V(u)) ue N T be a partition of a nonempty set V such 
that V(u) is nonempty if u is a leaf (but is possibly empty otherwise). For each node u, 
we let T(u) be the rooted subtree consisting of u (its root) and the nodes reachable from u 
by a directed path. We let V(u) be the union of the sets V{z) where z is a node of T{u). 
Hence, V(u) = V if u is the root. The family T = T(T, V) of sets V(u) has the following 
properties: 

(P0) V G T, i J 7 , 

(PI) no two elements of T overlap. 

An element of a family T of subsets of a set V will be called an ^-module. Every 
family T satisfying properties P0 and PI is associated as above with a rooted tree Tj- that 
is defined as follows. We take T as set of nodes, V as root, the inverse of inclusion as 
ancestor relation. The leaves are the minimal ^"-modules. For a node N, we let V(N) = 
N - \J{M G T | M C N} and we denote this family of sets by Vj-. We have V^{N) = N. 
Hence !F(Tj-,Vj-) = T '. We have Vj-(N) / for every node iV of outdegree 1. Every pair 
(T, V) such that J-(T, V) = J- and Vj^(N) is nonempty for every node N of outdegree 1 is 
isomorphic to (Tjr, Vj?) (this means that there exists an isomorphism h of T onto Tjr such 
that Vjr(h(u)) = V(u) for every node u of T). 

If M, P G J- their least common ancestor in is the least J-"-module N containing 
MUP. We define a binary relation boxT r (x, N) holding if and only if x belongs to Vj?(N). 
We also define a binary relation modT F {x,N) holding if and only if x belongs to Vj?(N). 
This relation is membership since the nodes of Tjr are the ^"-modules. However, it will be 
useful later when we construct Tjr as an abstract tree, and not as a set of sets ordered by 
inclusion. The relations 6oxjy and mod^ are definable from each other with the help of 
the son relation of the tree Tjr. 

If the family T satisfies the stronger property: 

(P'O) V G T, i J 7 , {v} G T for each v G V, 
then the leaves of Tjr are the singletons {v}, Tjr is a proper tree and Vj-(u) is empty if u is an 
internal node. If a family T satisfies only P0 and PI, then the family T + = J-L){{v} \ v G V} 
satisfies P'O and PI. The corresponding tree Tj-+ is obtained from Tj- as follows: for each 
v such that {v} ^ J 7 , we add {v} as a new leaf with father the least ^-module containing 
v. 

Let C be a class of relational structures (see Appendix 1 for definitions). For each S G C, 
we let F(S) be a family of subsets of its domain D$- We say that T is MS-definable if there 
exists an MS formula <p(X) such that for every S in C, F(S) = {A \ A C D S ,S \= <p(A)}. 
With these definitions: 

Proposition 2.2. Let C be a set of 1Z- structures and T(S) be an MS-definable family of 
subsets of Ds which satisfies P0 and PI for every S G C. There exists a domain extending 
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MS-transduction that associates with (S,^,), where S = (Dg, (Rs)nen) £ C and Ds is 
linearly ordered by =4, the structure: 

Dec(S) = (D s U N Tl (Rs)Ren, son T , box T ) 

where T = T^^s) and boxT = boxT T(S) - 

We will give a proof of this proposition adapted from that of Section 5. In the 
structure Dec(S) the domain consists of elements of different natures. If we are given a 
structure (Du, (Ru)reK, son\j,box\j) intended to be isomorphic to {Ds U Nt, (Rs)rgTZ, 
sotit, boxT) we can identify the nodes of T as the elements x of Dy such that sonjj(x, y) or 
sonu(y,x) or boxT(y,x) holds for some y. (We assume Ds nonempty; T may have a single 
node). 

Monadic Second-order logic (MS logic in short) and Monadic Second-order transduc- 
tions (MS transductions) are defined in Appendix 1. We only recall here that an MS 
transduction (also called sometimes an MS interpretation) is a transformation of relational 
structures that is specified by MS formulas forming its definition scheme. It transforms 
a structure S into a structure T (possibly over a different set of relations) such that the 
domain Dt of T is a subset of Ds x {1, . . . , k}. The numbers l,...,k are just a conve- 
nience for the formal definition; we are actually interested by relational structures up to 
isomorphism. In many cases, this transformation involves a bijection of Ds onto a subset 
of Dt, and the definition scheme can be constructed in such a way that this bijection is 
the mapping: x i— > (x,l). Hence, in this case Dt contains Ds x {1}, an isomorphic copy 
of Ds and we will say that the MS transduction is domain extending, because it defines 
the domain of T as an extension of that of S. This does not imply that the relations of T 
extend those of S. An FO transduction is a transduction defined by a first-order definition 
scheme. 

Definition 2.3. The leaves of a tree. Let T be a proper rooted tree. We write x < y if x is 
below y and we denote by y V z the least upper bound of two elements y and z. The root is 
thus the unique maximal element of T for this order. We let A(T) = (Leaves(T) , Rt) where 
Leaves(T) denotes the set of leaves of T and Rt(x, y, z) holds if and only if x < y V z. The 
next lemma shows that if Leaves(T) is linearly ordered by some auxiliary order then T 
is definable from (\(T), =4) by a domain extending MS-transduction. The resulting tree T 
does not depend on the linear order =<I. 

Lemma 2.4. There exists a domain extending FO transduction that maps (A(T),=^) to T, 
whenever T is a proper rooted tree and =^ is a linear order on Leaves(T). 

Proof. Let T be a proper rooted tree and ^ be a linear order on its leaves. For every internal 
node x of T we let: 

• fl(x) be the ^-smallest leaf below x, called the first leaf below x, and we let fs(x) 
be the unique son y of x such that fl(x) < y; 

• rep(x) be the ^-smallest leaf below x and not below fs(x) (this is well-defined 
because in a proper tree, every internal node has at least two sons). 

We call rep(x) the leaf representing x. We have fl(x) < x, rep(x) < x, and fl(x) ~< rep(x). 

Claim 1: Let x,y be two internal nodes. If rep(x) = rep(y) then x = y. 

Proof of the claim. By contradiction. Let x, y be distinct internal nodes such that u = 
rep(x) = rep(y). Since u is below x and y, x and y are comparable. We can assume that 
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x < y. By the definitions, u is not below fs(x). Hence fl(x) -< u and fl(x) < fs(x) < x. 
Since x < y, u and fl(x) are below the same son of y, call it z; we may have x = z. We have 
fKv) — f s {y) < Di where fs{y) ^ z since u = rep{y). Hence, since u = rep(y), u is the 
^-smallest leaf among the set of leaves below y and not below fs(y), and this set contains 
fl(x). Hence u =^ fl(x), contradicting the above observation that fl(x) -< u. □ 

One can define a bijection of the nodes of T onto a subset of A(T) x {1, 2}. Each leaf it 
is mapped to (u, 1), hence the transduction we are constructing will be domain extending. 
Each internal node u is mapped to (rep(u), 2). 

Claim 2: One can write a first-order formula a(x, y, z) such that: 

(A(T), =<;) |= a(x, y, z) if and only if x ^ y and z = rep(x V y) . 

Proof of the claim. We recall that R(u, v, w) means: u < v V w for leaves u, v, w. The 
relation < denotes the ancestor relation in T and should not be confused with the linear 
order =<: on the set of leaves of T. Using R, one can construct an FO formula (3(x,y,z) 
expressing that x ^ y and z = fl(x V y). An FO formula j(x, y, u, v) can be constructed to 
express that: 

x ^ y, u ^ v, u < x V y, v < x\J y, x < mV v, and y <uV v , 

which means that for leaves u,v ^ u,x,y ^ x, u and v are below x V y but are not below 
the same son of this node. We can construct a(x, y, z) so as to express the following: 

3u[(3(x,y,u) A"z is the ^-smallest element such that j(x,y,u,z) holds"] . 

This completes the proof Claim 2. □ 

We let N = (A(T) x {1}) U (REP T x {2}), where REP T is the set of leaves of the form 
rep(x V y) for some leaves x, y ^ x. We order N by letting: 

(x, 1) < (y, 1) if and only if x = y, 

(x, 2) < (y, 1) never holds, 

{x, 1) < (y, 2) if and only if there exist leaves u, v such that 

y = rep(u V v) and Rt(x, u, v) holds, 
(x, 2) < (y, 2) if and only if there exist leaves u, v, w, z such that 

x = rep(u Vd), y = rep{w V z), Rt(u, w, z) and Rt(v, w, z) hold. 

Claim 3 : The tree (T, <) is isomorphic to (N, <) under the bijection which maps a leaf u 
of T to (u, 1) and an internal node u to (rep(u), 2). 

Proof of the claim. The four clauses above correspond to the facts that two different leaves 
are incomparable, that an internal node cannot be below a leaf, that a leaf x is below an 
internal node u V v if and only if Rt(x, u, v) holds, and that an internal node u V v is below 
w V z if and only if u and v are both below wVz. □ 

These claims give the desired result because the set REPt is FO definable in the 
structure (A(T), =<!) by Claim 2. The ancestor relation defined by the formulas before Claim 
3 is also FO definable. From it, one can obtain an FO definition of the son relation. Hence, 
we have an FO transduction as claimed. □ 
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Remark. On the role of =4. The role of the linear order =<! is to make possible the con- 
struction of a set REPt so that FO formulas can specify in a unique way the element of 
REPj- intended to represent an internal node, and to express in terms of this encoding, the 
son relation of T. The tree T is uniquely defined for every structure A(T). Uniquely means 
here that if T and T' are such that A(T) = A(T'), there is a unique isomorphism of T onto 
T' that is the identity on leaves. 

Proof of Proposition^ We will use F(S)+ instead of T{S). (We have f{S) + = F(S) if 
J-(S) satisfies P'O). It is clear that J-(S) + is MS definable. We construct a structure with 
domain a subset of D$ x {1, 2, 3}. Its domain is the union of three sets: 

- the set Dg x {1}, a copy of Ds, 

- the set Ds x {2} which is the set of leaves of T = Tj?, s <.+ (the pair (v, 2) represents 
the leaf {v} for each v in Ds), 

- and of a subset of Ds x {3}, namely REPt X {3}, (cf. the proof of Lemma l2.4j) in 
bijection with the set of internal nodes of T. 

The relation Rj-(z,x,y) is "z £ N where N is the least set in J-(S) that contains x and 
y". This is expressible by an MS formula. Hence the structure X(T) is definable from S 
by an MS-transduction. Since the set of leaves of T is linearly ordered by ^ (because Ds 
is, and is in bijection with Ds) we can obtain T from (A(T), =<!) by a domain extending 
MS-transduction. 

Then we reduce T = T^, s \+ into Tj:[s) by eliminating the leaves (y, 2) such that {v} £ 
T(S). The relation boxr T( ^ S) is also MS definable since modr T{S) {x,u) holds (cf. Definition 
12. 1[) if and only if in the tree T^, s \+ : the singleton {x} is a leaf below u. We obtain thus an 
MS transduction. A definition scheme for it can be written from the above description. □ 

Remark. One could alternatively define the domain of the constructed structure as a 
subset of Ds x {1, 2} by letting (v, 1) represent simultaneously the element v of Ds and the 
leaf {v} of Tjrts), in the case where {v} G ^(S). In this case, the internal nodes of T«5) are 
pairs (v, 2). However, in most cases, we will keep separated the domain Ds of the structure 
and the set of nodes of its decomposition tree, even if there is a "natural" bijection between 
a subset of Ds and a set of nodes of the tree. 

An MS property is order-invariant if it is expressible by an MS formula using an 
auxiliary linear order =^ of the domain of the considered structure, that can be chosen 
arbitrarily. See Appendix 1 for a more precise definition. 

Corollary 2.5. Under the hypotheses of Proposition OOJ every MS property of the struc- 
tures Dec(S) for S € C is equivalent to an order-invariant MS property of the structures 
S. 

Proof. Let V be an MS property of the structures Dec(S). By Proposition A. 1.2 (in Ap- 
pendix 1), applied to the transduction of Proposition 12. 2\ V(Dec(S)) is equivalent to an 
MS property Q of (S, =4), where ^ is any linear order of Ds- For any two linear orders ^ 
and =<! / on Ds, one obtains isomorphic structures Dec(S) by the remark before the proof 
of Proposition 12.21 Hence Q is an order-invariant MS property. □ 
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Definition 2.6. Partitive families of sets Let V be a nonempty set. A family T of subsets 
of V is weakly partitive if it satisfies the following properties: 

(PO) V G T, i T. 

(P2) For every two overlapping ^"-modules A and B we have A U B, A n 5, A — B G JF. 

It is partitive if, in addition, it satisfies the following property: 

(P3) For every two overlapping ^"-modules A and B we have AAB G JF, (where AAB = 
(A- B)U(B - A)). 

The strong ^"-modules are the .F-modules which do not overlap any other ^-module. 
The family Strong(J-) of strong jF-modules satisfies PO and PI. The corresponding rooted 
tree T strong ^ is called the decomposition tree of T or of the structure S, if T = ^(S) is 
known from the context. Its leaves are the minimal ^"-modules (they are strong). They 
are the singletons {v} for all elements v of V if T satisfies P'O. Since a singleton does not 
overlap any set, if T is weakly partitive or partitive, then T + is weakly partitive or partitive 
respectively. 

The conditions of partitivity and weak partitivity on a family T imply some particular 
structure associated with the nodes of Tstrong(T)- They are formulated in an easier way in 
terms of the tree Tstrong(T+) rather than in terms of T strong ^y We recall that the nodes 
of Tg trang rp+\ are subsets of V. 

Theorem 2.7. Let T be a partitive family. 

(1) Every internal node N of the tree Tg trong rj?+\ satisfies one of the following two proper- 
ties: 

Tl: N has k sons, Ni, . . . , iV^, k>2, and for every nonempty subset I of {1, . . . , k}, the 

set \J{Ni \ i€l} belongs to T . 
T2: iV has k sons, Ni, . . . , iV&, k > 2, and for every subset I of {1, . . . , k}, the set 

U{Ni | % £ 1} belongs to T if and only if I is {1, . . . ,k} or singleton. 

(2) If an J 7 -module is not strong, it is of the form \J{Ni \ i E 1} for some node N satisfying 
Tl and a non singleton set I C {1, . . . , k}. 

Let T be weakly partitive. 

(3) Every internal node N of the tree Tg trong ^+^ satisfies one of properties Tl, T2 or 
T3: The sons of N can be numbered N±, . . . ,N^, k > 2, in such a way that for every 

subset I of {1, ... , k}, the set |J{iVj [ i £ J} belongs to T if and only if I is an 
interval [m, n] for some m, n with 1 < m < n < k. 

(4) If an J 7 -module is not strong, it is of the form [J{Ni \ i E 1} for some node N satisfying 
Tl and a non singleton set I C {1, . . . , k}, or of the form |J{iVj [ i G [m, n]} for a node N 
satisfying T3 and m < n. □ 

See (UEIimiElUni f° r ^ e proof. The nodes of types Tl, T2, T3 are called respectively 
the complete nodes, the prime nodes and the linear nodes. In this theorem, one could 
require k > 3 in conditions Tl and T3 because the nodes with two sons satisfy T2, and 
then properties Tl, T2 and T3 would be mutually exclusive. However, in the application 
of this theorem to the modular decomposition, properties Tl, T2, T3 and the notions of 
complete, prime or linear nodes correspond to three different graph operations, and those 
corresponding to Tl and T3 may have two arguments only. 

This theorem will be used for classes C of relational structures, where for each S in C: 
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(i) we have a partitive or weakly partitive MS definable family T(S) of subsets of its 
domain D$, 

(ii) for each node N of the decomposition tree of S, one can express S[N], the substruc- 
ture of S induced by N, as a composition of the substructures S^iVi], SfiV^], . . . , S'fiVfc] 
by an operation / (such operations can be seen as generalized concatenations) where 
Ni, . . . , Nk are the sons of N, 

(iii) the nature of this operation / can be determined by an MS formula with free vari- 
ables which take Nx, N2, ■ ■ ■ , as values. 

In this case, Proposition 12.21 can be improved, and one can define an MS transduction 
that takes as input S = {Ds, (Rs)rg'r)i together with an arbitrary linear order =<! of D$ 
and produces a structure Rep(S) consisting of the decomposition tree of S augmented with 
some relations which encode the operations /, and from which S can be reconstructed by 
an MS transduction. Such a structure contains information on the hierarchical construction 
of S, and it is, in some cases, a space efficient representation of S. (See the book by Spinrad 
|29| on efficient graph representations in a very general sense). 

Hence our method consists in doing the following steps: 

(i) first, we construct from (S, =^) a structure: 

Dec(S) = (D s U N T , (Rs)r£11, son T , box T ) 

which includes S and the decomposition tree T together with the relation boxT 
which links both; this structure is independent of =3! up to isomorphism; 

(ii) second, we construct, if possible, a structure Rep(S) with domain Ds U Nt and 
relations son.T,boxT together with some relations encoding the operations /. The 
objective is here to have a space efficient representation of S, from which S can be 
reconstructed by an MS transduction. 

In some cases, the structure Rep(S) encodes a term over a signature of operations 
on graphs or, more generally, on relational structures, the value of which is S. If these 
constructions can be done with MS transductions then Corollary 12.51 applies to Rep(S) in 
place of Dec(S). At this point it is not appropriate to formalize more this notion in the 
general setting. We rather show its application in two important examples, the modular 
decomposition based on the family of nonempty modules of a graph, and the decomposition 
in blocks of certain directed acyclic graphs called inheritance graphs. Some new results are 
also established. 

2.2. The modular decomposition. Graphs are simple, directed, loop-free. Simple means 
that there is at most one edge from a vertex x to a vertex y. Graphs are finite, as already 
indicated. We denote by x — ► y the existence of an edge from x to y. The undirected 
graphs are those where each edge x — ► y has an opposite edge y — ► x. We write x — y 
if x — ► y and y — ► x. We denote by V^the set of vertices of a graph G. If X is a set of 
vertices of G, we denote by G[X] its induced subgraph consisting of X and all the edges, 
the two ends of which are in X. If E is a set of edges, we denote by G[E] its subgraph 
consisting the edges of E and their end vertices. 

Definition 2.8. Modules and graph substitution. A module of a graph G is a subset M of 
Vq such that for every vertices x, y in M and every vertex z not in M: x — ► z implies 
y — ► z and z — > x implies z — ► y. In words this means that every vertex not in M "sees" 
all vertices of M in the same way. This frequently rediscovered notion is surveyed in [2E1 
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(see also [20] for numerous references using various names for the same notion). The book 
by Spinrad [29 contains also many definitions, results, algorithms and references. 

We denote by A4(G) the family of nonempty modules of a graph G. It satisfies Property 
P'O (each singleton is a module) and is weakly partitive. It is partitive if G is undirected. 
We denote by S{G) the corresponding family of strong modules: they are the nonempty 
modules that do not overlap any module. The tree of strong modules is called the modular 
decomposition, and its leaves are the vertices of the considered graph ("are" means that 
we identify v and {i>}). The relevant operations that combine substructures are vertex- 
substitutions, that we now review. 

If G and H are graphs with disjoint sets of vertices, and u is a vertex of G, we denote 
by G[H/u] the graph such that: 

(a) its set of vertices is Vg U Vh — {u}, 

(b) its edges are those of H, those of G that are not incident with u, the edges x — ► y 
whenever x G Vq — {«}, x — > u in G, y G Vh, and the edges y — > x whenever 
x E Vq — {u}, u — > x in G, y € Vh- 

If G and H are not disjoint, we replace H by an isomorphic copy disjoint with G. When 
we write: let K be a graph of the form G[H/u] we assume, unless otherwise specified, that 
G and H are disjoint. This graph is called the result of the substitution of H for u in G. It 
is undirected if G and H are. 

If tii, . . . ,u n are vertices of G and Hi, ... , H n are graphs, we define G[H\/u\, . . . , H n /u n ] 
as G[H\/ui} . . . [H n /u n ]. The order in which substitutions are done is irrelevant, hence we 
can consider they are done simultaneously. 

A graph is prime if it has at least 3 vertices and is not of the form G[H/u], except in a 
trivial way with G or H reduced to a single vertex. The paths a — ► b — ► c and a — b — c — d 
are examples of small prime graphs. 

We will also use the graph operations 0, © and © : G © H is the disjoint union of G 
and H, G®H is G © H augmented with edges from each vertex of G to each vertex of 
H, and G © H is G®H augmented with edges from each vertex of H to each vertex of 
G. In all cases, we replace if necessary H by an isomorphic copy disjoint with G. These 
operations can be defined by K[G/u, H/v] for graphs K with two vertices u and v, and, 
respectively, no edge, an edge from u to v, edges between u and v in both directions. They 
are associative. We will consider them as operations of variable arity in the usual way. 
The operations © and © are also commutative. They transform undirected graphs into 
undirected graphs. More generally, every graph G can be turned as follows into a graph 
operation. We enumerate its vertices as v\, . . . ,v n , and we define an n-ary graph operation 
ug (where a stands for substitution) by gg{Hi, . . . , H n ) = G[H\/v\, . . . , H n /v n ]. 

Let us go back to modular decomposition. The complete nodes are of two possible 
types, © or ©, because if iV is a "complete" strong module with n sons N\, . . . , N n , then 
either G[N] = G[JVi] © • • • © G[N n ] or G[N] = G[Ni] © • • • © G[N n ]. 

If iV is " linear" with n sons Ni, . . . , N n ordered in this way (cf. T3 in Theorem I2.7|) , 
then either G[N] = G[Ni] © . . . ®G[N n ] or G[N] = G[N n ]® . . . g> G[iVi]. 

If N is "prime", with n sons N 1 ,...,N n then G[N] = (Tk(G[Ni], G[N n }) for some 
prime graph K. We have n > 3; the operations corresponding to the graphs K with 2 
vertices are ©,© and ©. 
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The terms complete, linear and prime are denned after Theorem 12. 71 If the given graph 
is a dag, i.e., a directed graph without circuits, then no node is of type (g>. If it is undirected, 
no node is of type ® . 

An MS formula tp<£,(X,Y) can express that X is a complete strong module of type 
and Y is one of its sons. An MS formula ip®{X, Y) can do the same for ®. An MS formula 
(p^(X, Y, Z) can express that A is a linear strong module, Y and Z are two sons such that 

G[X] = . . . ® G[Y] <g> G\Z\ ® An MS formula <£>p r (A, Y, Z) can express that A is a prime 

strong module, Y and Z are two sons such that G[X] = K[. . . , G[Y]/v,i, . . . , G[Z]/itj, . . . .] 
where Uj — > Uj in the prime graph K. 

By using these formulas, one can build the graph representation of the modular decom- 
position of a graph G, denoted by Gdec(G). This is a binary relational structure consisting 
of the rooted tree 15(G) enriched with the following informations: 

(a) its nodes of types ©, (g> and (g> are labelled by their respective types, 

(b) if A is a "linear" node and G[N] = G[A X ] <g) . . . ®G[A n ], we set an edge Aj — ► N i+1 
for each £ = 1, . . . , n — 1. 

(c) If A is a "prime" strong module, Aj and Nj are two sons and 

G[N] =K[..., G[Ni]/ui, G[Nj]/uj, . . .] , 
we set an edge Aj — > Nj whenever U{ — * Uj in K. 

This representation is in certain cases space efficient. Consider the graph G of a strict 
linear order on n elements (in other words, a transitive tournament). This graph has n 
vertices and n{n — l)/2 edges. The graph Gdec(G) has n + 1 vertices and 2n — 1 edges. 

The tree Ts(gt) and the structure Gdec(G) can be constructed by MS transductions using 
an arbitrary linear order of the vertices of the given graph as auxiliary information. From 
the relational structure Gdec(G), that is actually a vertex- and edge-labelled directed graph, 
one can reconstruct G by an MS transduction. We refer the reader to for illustrated 
examples and further developments, and to ^3] for the extension of these constructions to 
countable graphs. 

There are two distinct extensions of modular decomposition to hypergraphs: the decom- 
position into committees of undirected unranked hypergraphs (see [HE]) and the modular 
decomposition of k-structures which are A;-ary relational structures, hence are labelled di- 
rected hypergraphs of rank k (see [2131 )■ I* 1 both cases the families of sets are MS definable 
and Proposition 12.21 is applicable. 

2.3. Factors in directed acyclic graphs. We review some results of Courcelle jllj . 
Capelle jH], Habib et al. [23] concerning directed acyclic graphs. We show that they can be 
reformulated in the framework of this section and slightly improved. 

Definition 2.9. 2-graphs and 2-dags In this subsection, we consider directed graphs, pos- 
sibly with multiple edges (hence, not necessarily simple, as in the previous section). Those 
without circuits (and loops) are called dags (for directed acyclic graphs). We denote by Eg 
the set of edges of a graph G. A 2-graph is a graph with two distinct distinguished vertices 
denoted by si(G) and S2(G) called its sources. We denote by G° the underlying graph, i.e., 
the same graph without distinguished vertices (the sources are turned into "ordinary" ver- 
tices). We use "2-graph" as an abreviation of "graph with 2 sources" also called sometimes 
"2-terminal graph"; 2-graphs are not particular C-graphs in the sense of the definition of 
clique-width, recalled in Appendix 2. 
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A 2-dag is a 2-graph without circuits such that s\{G) is the unique vertex of indegree 0, 
S2{G) is the unique vertex of outdegree and every vertex is on a directed path from s\(G) 
to 82(G). We denote by V G the set V G — {si(G), 52(G)}, called the set of internal vertices 
of G. An orientation of a graph making it into a 2-dag is also called a bipolar orientation. 

For example, the graph of the "Wheatstone bridge" consisting of the directed path 
a — > b — > c — > d with additional edges a — > c and b — ► d is a 2-dag if its two sources 
are a and <i (in this order) and is a 2-graph and a dag but is not a 2-dag if its two sources 
are a and b. 

A factor of a 2-dag G is a 2-dag such that H° is a subgraph of G° and if an edge of G 
has one end in Vjj, then it is in H. An edge is a factor (its two ends being the sources) and 
a 2-dag is one of its own factors. We let J : £(G) denote the set of edge sets of the factors of 
G. 

The following proposition is proved in Lemma 3.5 and Corollary 3.6. Its first 
assertion is also proved in [3] and |23j . 

Proposition 2.10. For every 2-dag G, the family J : £(G) is weakly partitive and MS de- 
finable. □ 

Note that J-£(G) satisfies Property P'O. The family of strong J r <S(G)-modules is denoted 
by SJ : £(G). In order to define the tree Tg^^tQ) of a 2-dag G by an MS transduction we 
need to quantify over edge sets. Hence, we represent graphs by their incidence structures. 
For a graph G, we let Inc(G) = (Vq U Ecincc) where Vg is the set of vertices, Eq is 
the set of edges and incQ is the ternary relation such that incc(e,x,y) holds if and only 
if e : x — > y in G. By the general definitions, the binary relation modsF£( G ) defines for 
every node x of T S jr £ ^ the set of edges of the corresponding factor, that we will denote 
by G(x); hence G(x) = G[N] if N is the ,F£(G)-module represented by the node x. The 
following proposition is Theorem 3.12 of |llj . 

Proposition 2.11. There exists an MS transduction that transforms Inc(G) into Dec(G) = 
(Vg UE g U N Tst£{G) , incQ, son TsTe{G) , mod SJ r £{G) ) for every 2-dag G. □ 

Although the leaves of Tsf£(g) are ( or correspond to) the edges of G, we keep Nt s:f£(G) 
and Eg disjoint in the structure Dec(G). Proposition 12.21 could be used here because the 
family SJ r £(G) is MS definable (since graphs are represented by their incidence structures), 
but it would give a weaker result than Proposition 12 . 1 1] because we would need an auxiliary 
linear ordering of the edges of G as input of the transduction, which is not the case in 
Proposition 12.111 

The MS transduction of Proposition 12 .111 uses edge set quantifications. In the case of 
simple graphs, one can do the same without edge set quantifications. 

Corollary 2.12. There exists a monadic second-order transduction that transforms the 
structure (Vg, edg G ) into: 

(V G U N Tsrs(G) , edg G , son Ts:F£{G) , iact ST£{G) ) 

for every simple 2-dag G, where factsjr£( G )(v,x) is defined to hold if and only if v is a 
vertex of G(x). 

Proof. The proof of Theorem 3.12 in [T^ defines (V G U E G U Nt , inc G , sout , mod^) from 
(Vq U Eq^ucg) by an MS transduction that specifies the set of nodes of T = T S jr £ ^ as 
follows: Its internal nodes are pairs (v, i) where i = 2 or 3 and v is a vertex, or pairs (e, 3) 
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where e is an edge. The latter case corresponds to factors which are sets of at least two 
parallel edges. In the case of simple graphs, there are no such factors, hence these pairs 
are not needed. A leaf of T corresponds to a factor of G reduced to a single edge e and is 
defined as the pair (e,4). However, from the above remark, its father is an internal node 
defined as a pair (v, i) where i = 2 or 3 for a vertex v. Hence, this leaf can be defined as 
the pair (v, 4) in the former case and (v , 5) in the latter. It follows that T can be specified 
with a set of nodes defined as a subset of V G x {2, 3, 4, 5}. □ 

We now review the graph operations associated with the various types of nodes of 
Tsrs(G) where G is a 2-dag. We define them actually for 2-graphs. 

Definition 2.13. Operations on 2-graphs The main operation is edge- substitution. Two 
other operations will be defined as particular instances of it. For a 2-graph K with directed 
edges ei, . . . , efc, we denote by K[Gi/e\, . . . , Gfe/e^] the result H of the substitution of the 
2-graphs G\, . . . , Gk for the edges e±, . . . , respectively. For defining H, we assume what 
follows: 

(i) K, Gi, . . . , Gk have pairwise disjoint sets of edges, 

(ii) ei : si(Gj) — ► s 2 (Gi) for each i, 

(iii) K, G\ , . . . , Gk have no vertices in common other than the ends of the edges ej and 
the sources of Gi, as required by (ii). 

We let V H = V K U V Gl U ■ ■ ■ U V Gk , E H = E K U E Gl U • • • U E Gk - {ei, . . . , e k } and 
Si(H) = Si(K) for i = 1,2. If the graphs are not disjoint as required, one takes disjoint 
copies and one fuses the sources of the graphs Gi with the end vertices of the edges e% of 
K. In this case, the result of the substitution is well-defined up to isomorphism. 

The parallel composition of two 2-graphs G and H is the graph G//H defined as 
K[G/e,H/f] where K consists of two parallel edges, e,f: s±(K) — ► s-z{K). This op- 
eration is associative and commutative so that the expression G\j j . . . / /Gk is well-defined, 
and the ordering of the arguments is irrelevant. We define similarly the series composition: 
G • H = K[G/e, H/f] where K consists of two edges e : s\{K) — ► u and / : u — > S2(K) 
for some (arbitrary) u. This operation is associative, so that the expression G\ • ■ ■ ■ • G k 
is well-defined, but the order of arguments matters. In order to have a shorter notation we 
will use 0k(Gi, . . . , Gk) for K[G±/ei, . . . , G^/e^] where e±, . . . , is an enumeration of the 
set of edges of K (it is not made explicit in the notation Ok)- 

We will also use the constant e denoting the 2-dag consisting of the single edge e : x — ► 
y, with si(e) = x, 52(e) = y, and the constant e defined similarly, with si(e) = y, 82(e) = x. 

Terms built with these operations and constants denote 2-graphs. In some proofs, we 
will require that the arguments of each operation in a term are graphs with disjoint sets of 
edges (hence not graphs up to isomorphism). In this case, if t is a term denoting a graph 
with edges e±, . . . , ek, it has k occurrences of constants, which are or ej for i = 1, . . . , k. 

We now consider the case of 2-dags. 

Proposition 2.14. Let G be a 2-dag. An internal node N of its decomposition tree T s:F£ i G ^ 
is of one of the following mutually exclusive types: 

1) N is a complete node with sons N±, . . . , Nk, N = N± U • • • U Nk, we have G[N] = 
G[Ni]/ j ... j /G[Nk] and none of N\, . . . ,Nk is a complete node. 

2) N is a prime node, G[N] = 9k(G[Ni], . . . , G[Nk]) where K is a 2-dag that cannot 
be written L • M or L/ /M or K'\M\j f\, . . . , M p / f p ] except in a trivial way, with 
either K' or all M±, . . . , M p reduced to single edges. 
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3) N is a linear node with sons Ni, . . . ,N k , we have N = N% U • • • U N k , none of 
Ni,...,N k is linear and, either G[N] = G[Nx] • • G[N k ] or G[N] = G[N k ] • • 
G[iVa]. 

A leaf of this tree is an edge e : si(e) — > 52(e). 

Proof. This follows from Proposition I2.1UI and Theorem 12.71 The three types of nodes 
correspond respectively to properties Tl, T2 and T3 of Theorem 12.71 In all three cases, 
the graphs G[iVi], . . . , G[N k ] have disjoint sets of edges and we need not make isomorphic 
copies. In the second case, the vertices of K are vertices of G. Its edges are not edges of 
G: they mark positions where the subgraphs GfiVi], . . . , G[N k ] must be substituted. The 
graph K is simple because otherwise it can expressed as e/ jM or as K'[M\/ f\, . . . , M p / f p ] 
in a nontrivial way with some Mi consisting of two parallel edges. □ 

In Case 2, K has k > 3 edges because otherwise, it is of the form e • f or e//f, and 
8k is • or //. A 2-dag K satisfying the conditions of Case 2 will be called prime. In Case 
3, the sons of a node will be numbered so that G[N] = G[Ni] • • • • • G[N k ]. For building 
a term t denoting a 2-dag, we need only the operations •, // and 8 k where K is a prime 
2-dag, and the constants e. 

We now examine how such a term can be constructed in MS logic. MS formulas analo- 
gous to the formulas 92®, tp&, cp^, (pp T used in Subsection l2.2l for the modular decomposition, 
can recognize which case applies to a given module N, and can specify its sons. According 
to the general method sketched at the end of Subsection l2.11 we transform the structure con- 
structed by the transduction of Proposition l2~TTl into a graph representation of the canonical 
decomposition of the considered 2-dag, intended to be as space-efficient as possible. We let 
Rep{G) be the structure: 

(V G U Eq U N Tsreia) , inc G , son Tsre(Gy src lsre{G)l src 2S FS(G)JeafT ST£(G) ) , 

where: 

(1) srciSF£(G) = {{x,Si(G(x))) I x G N Ts:F£(G) } and 

(2) leafT STe{G) = {(x, e) | x is a leaf of T s -p £ ^ and e is the unique edge of G(x)}. 

We replace thus the relation modsF£(G) °f Dec(G) by three functional relations, and we 
avoid a certain amount of redundancy. The relation mod s:F£ ^ can be defined by an MS 
formula in the structure Rep{G). The relation leafT ST£( ^ G) is useful to establish the bijection 
between the leaves of the tree and the edges of the considered graph. For simple graphs, 
we can use the simpler structure: 

Rep'(G) = (V G U N Tst£{G) , edg G , son TsT£{G) , src lST£{G) , src 2 sre{G)) 

because there is no need to relate a leaf of the tree to the corresponding edge which no 
longer exists as an element of the domain. 

Definition 2.15. Separated representations. As explained above, to every internal node of 
the tree T S:F£ ^ corresponds an edge substitution operation 8k, and this tree can be consid- 
ered as the syntax tree of a term t that denotes the 2-dag G and is written with operations 
8k and constants e denoting the different edges. In a proof in the next subsection, we will 
transform such a term t denoting a 2-dag into another one t', intended to denote a 2-graph 
G', by replacing at certain occurrences in t, some operations 8k by operations 8k< of same 
arity. Then the evaluation of t' giving G' will be done by an MS transduction. For this 
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purpose, we introduce a variant of the structure Rep(G), called a separated representation, 
where G is a 2-dag. 
We let 
Rep se P(G) = 

(V H UE g U N Tsf£(g) , inc H ,e - edg H , son TsT£{G) , ssrc 1S rs{G)i ssrc 2Sr£(yG) ,leaf TsT£{G) ) , 
where: 

(1) Vh is the set of pairs (x, i) for x £ Nt st£(g) and i = 1, 2, 

(2) mcjj is the set of triples (e, (x, 1), (x,2)) such that e G x is the corresponding 
leaf of T sre{ G^ , 

(3) e — edg H ((x,i), (y,j)) is defined as holding if and only if Si{G{x)) = Sj(G(y)) and, 
either x and y are adjacent (one is the father of the other) or x and y are sons of 
some node z, and Sj(G(x)) / for k = 1,2. 

(4) ssrc i(S ^ (G) = {(x, | x G A^ f(G) }, 

(5) the other sets and relations are as in Rep(G). 

These sets and relations define a graph H. Its vertices are pairs (x, 1) and (x, 2) denoting 
the two sources of the factors associated with the nodes x of the tree. Each pair represents a 
vertex of G. Since a vertex of G belongs to several factors, it has several representations by- 
vertices of H. For an example, if z is a node with sons x and y such that G{z) = G{x)»G{y), 
then si(G(z)) = si(G(x)), s 2 (G(z)) = s 2 (G(y)), s 2 (G(x)) = si(G(y)). The undirected e- 
edges, defined by the symmetric relation e — edg H materialize such equalities. In this case, 
we have the following e-edges: (z, 1) — (x, 1), (z, 2) — (y, 2) and (x, 2) — (y, 1). In the case 
where G(z) = G{x)/ /G(y), we have the e-edges: (z, i) — (x,i), (z,i) — (y,i), for i = 1,2, 
which represent the equalities Si(G(z)) = Si(G(x)), Si(G(z)) = Si(G(y)), and the equalities 
Si(G(x)) = Si(G(y)) follow by transitivity. The graph H has also edges which correspond 
to those of G, however, they are not adjacent in H. They are "separated" by e-edges. Since 
t is a term denoting a 2-dag, its leaves correspond to factors with an edge directed from the 
first source to the second one. This justifies condition (2). 

This graph H, denoted by Sep(G), can be "extracted" from Rep sep (G) which contains 
also Tsjrg^Q-j as it is defined from {Vh U Eg, incn, e — edg H ). Figure 1 below shows a graph 
G, and Figure 2 shows the corresponding graph Sep(G). Edge directions are omitted for 
the purpose of readability. Dotted lines represent the pairs in e-edg H . The following fact, 
which shows how one can reconstruct G from Sep(G), is clear from the definition. 

Lemma 2.16. The graph G is obtained from Sep(G) by the contraction of all e-edges. There 
exist MS transductions transforming Rep(G) and Rep sep (G) into each other, and Sep{G) 
into Inc(G). □ 

Remark. The structures Rep(G) and Rep sep {G) use fixed finite signatures. They encode 
terms written with the operations / / and • of variable arity, and the infinitely many opera- 
tions associated with the prime graphs K. They are interesting from the point of view of the 
study of graph structure, but they are not space efficient as can be the graph representations 
of modular decompositions. 

We now give an application related to matroids. 
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2.4. Whitney's 2-isomorphism theorem. We consider directed graphs without loops 
or isolated vertices and possibly with multiple edges. The cycle matroid of a graph G is the 
pair M (G) = (Eg, indepc) where, for X C Eg, indepG(X) holds if and only if G[X] has no 
undirected cycle. The matroid M(G) does not depend on the directions of edges, however, 
the definitions are given for directed graphs because edge directions will be useful for some 
constructions. For matroids in general we refer the reader to the books by White |S2] and 
Oxley [2B]- Actually, we will need no more than this definition. 

We say that two graphs G and H are equivalent if Eg = Eh and M{G) = M(H). We 
require the equality of the sets of edges but nothing on the sets of vertices. The vertices 
and the incidencies may be different in the two graphs. In particular, any two forests with 
the same sets of edges have the same (trivial) cycle matroids, independently of how their 
edges are incident with vertices. A theorem by Whitney characterizes the equivalence of 
2-connected graphs. 

Definition 2.17. Twisting . For a 2-graph G, we let G be the 2-graph with same underlying 
graph as G except that its sources are swapped: s\(G) = S2(G) and S2(G) = si(G). We 
recall that G° is G with its sources made into ordinary vertices. For disjoint 2-graphs G 
and H, we let Gj /H be their parallel composition, obtained from the union of G and H by 
the fusion of s\(G) and s\(H), and ofs2(G) and S2(H). It is important to note here that 
E G//H = E G U En. 

A graph is 2-connected if it is connected and the deletion of any vertex yields a connected 
graph. A graph with just one edge or several parallel edges is 2-connected, and we consider 
graphs without loops. A 2-graph G is 2-connected if (e/ /G)° is 2-connected. Edge directions 
do not matter in these definitions. A 2-dag is a 2-connected 2-graph. 

If G = (L//M) and H = (L//M) where L and M are connected 2-graphs, we say 
that H is obtained from G by a twisting. Note that Eh = Eg- Reversing an edge direction 
is a twisting. 

The equivalence of two graphs G and H without isolated vertices is characterized by 
Whitney's 2-isomorphism Theorem as the existence of a transformation of G into H by a 
finite sequence of twistings and of transformations of two other types called vertex splitting 
and vertex identification. See the chapter by J. Oxley in the book edited by N. White |32] . 
or [HOI- However, these latter transformations do not apply to 2-connected graphs. Hence, 
this theorem yields the following: 

Proposition 2.18. Two 2-connected graphs are equivalent if and only if one can be trans- 
formed into the other by a finite sequence of twistings. □ 

Figure 1 shows two graphs which are 2-isomorphic. Our aim is to prove the following 
theorem: 

Theorem 2.19. There exists an MS transduction that associates with (Vg U EQ,incG, =4) 
where G is a 2-connected graph and =4 ranges over all linear orders on Vg, the set of graphs 
having the same cycle matroid as G. 

Lemma 2.20. A 2-connected 2-graph G is either: 

(0) e ore or 

(1) G\l I . . . j /Gk for k > 2, and some 2-connected 2-graphs G\, . . . , G^ not of this form, 
or 
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Figure 1: Two 2-isomorphic graphs G (to the left) and H. 

(ii) G\ • • • • • G k for k > 2 and some 2-connected 2-graphs G\, . . . , Gk not of this form, 
or 

(iii) 6k(G\, . . . , Gk) where K is a prime 2-dag and G±, . . . , Gk are 2-connected 2-graphs. 

Prime 2-dags are defined after Proposition 12.141 These expressions are unique except 
for the directions of the edges of K and the ordering of G\ , . . . , Gk in (i) , but we will not 
need this fact. 

Proof. For a 2-dag G, this result is Proposition 12 .141 Otherwise G can be made into a 2-dag 
H by reversing some edge directions. The result holds for H, whence also for G by reversing 
again the same edges. This corresponds to changing certain constants e into e. fj 

It follows that every 2-connected 2-graph G can be expressed as the value of a term t 
belonging to the set T of finite terms defined recursively as follows: 
either t = e, 
or t = e, 

or t = //(ti, . . .,t k ), 

or t = m(ti, . . .,t k ), 

or t = 9 K (h, ■ ■ -,tk), 
where t\, . . . , are in T. 

We recall that we denote by 6k the substitution operation associated with K and the 
list e 1 ,...,e k of its edges: K (Gi,. . .,G k ) = K[Gx/e\, . . .,G k /e k ]. (The list ei,...,e k is 
implicit in the notation 9k)- The operations / / and • have a variable arity. A term in T 
obtained by using recursively Lemma 12 . 2UI will be called a canonical term for G. It is unique 
up to the ordering of the arguments of the operations // and up to the directions of edges 
in K. If the set of edges of G is {/i, . . . , f n }, then the constants occurring in a canonical 
term for G are fi or fi, . . . , f n or f„. 

Definition 2.21. Twistings of 2-graphs. Twisting for graphs is defined above. In order 
to characterize the twistings of 2-connected graphs in terms of their decompositions in 2- 
graphs, we extend the notion of twisting to 2-graphs. A twisting of a 2-graph G is a 2-graph 
H such that either H = G, or H is a twisting of G°, Si(H) = Si(G) for i = 1,2, and 
G° = (Lj /M)°, H° = (L/ /M)° in such a way that the two sources of G are two vertices of 
L, of M, or of both (we may have G = L//M). These conditions imply that for every 2- 
graph K, Kj /H is a twisting of K//G. They also imply that if G is a 2-connected 2-graph, 
then so are H and, L and M when the second case of the definition is used. 
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For every 2-graph G, we denote by V(G) the least set of 2-graphs containing G and 
closed under twisting, hence of 2-graphs obtained from G by a finite sequence of twistings. 
For every graph H in V(G), Eh = Eq, and either Si(H) = Si(G) for i = 1, 2, or Si(H) = 
a 3 -i(G) fori = 1,2. 

Lemma 2.22. Let G be a 2-connected 2-graph and H be a twisting of G. 

(i) If G = Gi/ / . . . / /Gk for k > 2, and some 2-connected 2-graphs G±, . . . , G k not of 
this form, then H = G\j j . . . / /Gi-i/ /Hi/ /Gi+i ■ ■ ■ / /Gk where Hi is a twisting of 
Gi, or H = L\/ 1 . . . / /L k where each Li is either Gi or Gi, 

(ii) if G = G\ • ■ • • • Gk for k > 2 and some 2-connected 2-graphs G\, . . . , Gk not of this 
form, then H = G\ • • • ■ • Gi— \ • Hi* Gj+i • ■ • • • Gk where Hi is a twisting of Gi, or 
H = G\ • • ■ • • Gi-i • Gj • Gj-x • • • • • GVfi • Gi • Gj+% • ■ ■ ■ • Gk for 1 < i < j < k, 

(iii) G = K[G\/e\, . . . , Gk/e-k] where K is a prime 2-dag and G%, . . . , Gk are 2-connected 
2-graphs, then H = K\G\/e\, . . . , Hi/ei, . . . , Gk/ek] where Hi is a twisting of Gi, or 
H = K[G 1 /e 1 ,...,G k /e k ]=G. 

Conversely, in all cases, every graph H of the above forms is either G or a twisting of G. 

Note the special cases of 

(i) H = G = Gi// ...//G k , and^ _ 

(ii) H = G = G k *G k - 1 *---*G 2 »G 1 . 

Proof. Let G be defined from G\, . . . , G k by one of cases (i)-(iii) and H be a twisting of G.If 
H = G, then the conclusions hold in all three cases. Let us now assume that G° = (L//M) 
and H° = (L//M) and, without loss of generality, that the two sources of G are vertices 
of L. 

We will prove that we have one of the following three cases: 

(a) M° is a subgraph of some Gi in any of cases (i)-(iii), then the replacement of M by 
M yields a 2-graph Hi, and by replacing Gj by Hi, we obtain H from G, as required. 

(b) G satisfies case (i) and the two sources of M are those of G: then M is the parallel 
composition of some of the Gj's, and we obtain H from G by replacing each of these 
Gj's by Gi, this is the second possibility of case (i). 

(c) G satisfies case (ii) and one source of M is a source of some factor Gj. Then the 
other one is also a source of some factor Gj (with i ^ j, otherwise case (a) applies). 
Then M = Gy • • • • • Gj> for some 1 < i' < j 1 < k, and H is defined by the second 
possibility of case (iii). 

To complete the proof, we need only verify that there are no other cases. 

As in the proof of Lemma I2.2U1 we make G into a 2-dag G' by reversing if necessary 
some edge directions. We denote by G\, L' and M' the 2-graphs obtained from Gj, L 
and M by these reversals. An internal vertex of M' is on a directed path from si(G') 
to S2(G'). This path goes through the two sources of M' . By changing if necessary the 
source numbers of M' we may assume that this path traverses M' from s\{M') to S2(M'). 
All paths associated in this way with the internal vertices of M 1 do the same. They must 
traverse M' from s\{M') to sz(M') otherwise M' whence G' has a circuit. Hence M' is a 
2-dag and a factor of G' . Clearly, the Gj's are also factors of G'. Consider its decomposition 
tree Tsfs(G'): the Gj's are the sons of its root. We apply Theorem I2.7f 4) to M': if it is a 
strong module (with respect to SJ-£(G')), it corresponds to a node of this tree, and thus 
is a factor of (possibly equal to) some Gj. If it is not strong it is a union of sons of a 
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strong module N, satisfying Tl or T3. If N is the root, we are in the above cases (b) or 
(c) for M', G' 1} . . . , G' k instead of M, G%, . . . , G k - Otherwise, M' is a factor of some G\. By 
resestablishing the original edge directions, we see that M satisfies one of (a), (b), (c) as 
required. 

This completes the proof of the "only if" direction. The other one is easy to verify. □ 

A k-permutation is a permutation of {1, . . . , k}. For every /c-permutation w, we denote 
by •tt the operation of arity k such that: 

• % (Gi, . . . , G k ) = •(G 7r ( 1 ), . . . , G n (ty). 

For every canonical term t, we denote by V(t) the set of terms defined inductively as follows: 

V(e) = V(e) = {e,e} 

v(//(ti,...,t fc )) = {//(si,...,s fc ) | Sl e v(ti)} 

V . . . , tk)) = {• 7r (si, . . . , Sfc) | Si G V(tj), 7r is a /c-permutation } 

S7(6 K (ti,...,t k )) = {9 K ( Sl ,...,s k ) | Si e V(ti)}U {^(si,...,^) | Si € V(*i)}. 

Lemma 2.23. For every 2-connected 2-graph G with canonical term t, the set of 2-graphs 
V(G) is the set of values of the terms in V(t). 

Proof. For every 2-connected 2-graph G we have in the four cases of Lemma I2.2UI 

(0) V(e) = v(e) = {e,e}, 

(1) v(Gi// . . . //G k ) = v(Gi)// . . . //v(G fc ), 

(ii) V(Gi • • • • • Gfc) = |J{v(G 7r ( 1 )) • • • • • V(G 7r (,t)) | 7T is a /c-permutation }, 

(iii) V(K[Gi/ei, G fc /efc]) = K[v(Gi)/ei, V(G fc )/e fe ] U ^ [v(Gi)/ei, V(G fc )/e fc ], 
where in all cases the operations on 2-graphs extend to sets of 2-graphs in the natural way. 

The result for cases (o),(i),(iii) follows from Lemmas I2.2UI and 12.221 For case (ii), the 
inclusion C follows from Lemma 12.221 (ii) , and the inclusion D follows also from the facts 
that every permutation is a composition of transpositions and that 

Gi • • • • • Gi-\ • Gj+i • Gi • ■ ■ ■ • Gk 

is a twisting of 

Gi • • • • • Gj_i • Gi • Gj + i • Gj + i • • • • Gk ■ □ 

The idea of the proof of Theorem 12.191 is illustrated by Figures 1 to 3. By reversing 
some edge directions if necessary, we make the given graph into a 2-dag G with its two 
sources the ends of some edge. From the decomposition tree Tgjr^^ we construct the 
canonical term t of G and the graph Sep(G) from which G is obtained by contraction of 
the e-edges. The e-edges of Sep(G) represent the graph operations with which t is built. In 
order to produce a term t! in v(t) yielding G' equivalent to G, it suffices to modify some 
operations in t according to Lemma f2.23l These modifications are reflected by modifications 
of the e-edges of Sep(G) giving a graph M(Sep(G)) (where M means "modification") from 
which Q is obtained by contracting the e-edges. All these manipulations can be done by 
MS transductions. 

Example 2.24. These definitions are illustrated in Figures 1-3. Figure 1 shows a graph 
G and a graph H that is 2-isomorphic to G. We make G into a 2-dag, the two sources of 
which are the ends of edge a. The corresponding canonical term is: 

t = //(a, .[//(c, .(b, f)),9 K (g, k, m, h, //(n, p)), //(d, e)]) 
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Figure 2: The separated graph Sep(G). 



where K is the graph (defined as K4 minus one edge). Figure 2 shows the corresponding 
graph Sep(G). The e-edges are represented by dotted lines. The graph M(Sep(G)) on 
Figure 3 is obtained by modifying certain e-edges, and the modified edges are represented 
by broken lines. These modifications correspond to replacing in t the subterm •(b,f) by 
• (f , b), 9k by 9^ and the operation • occurring first in the subterm »[ti, t%] by •„■ where 
7r(l) = 3, 7r(2) = 1, 7r(3) = 2. For the purpose of readability, the edges of G are undirected. 

We now detail the proof more formally. 

Proof of Theorem \2.1fA We choose in the given graph G two adjacent vertices, we make 
them into sources s\{G) and S2(G), and we change some edge directions to make G into a 
2-dag. This is possible by Lemma 3.1 in 11 since G is 2-connected. Furthermore this can 
be done by an MS transduction taking Inc(G) as input (by the reorientation technique of 
jH]). Hence, we obtain a 2-dag from G by a finite sequence of twistings if G is not a 2-dag, 
because reversing an edge is a twisting. Without loss of generality we now consider that the 
given graph G is a 2-dag. Using Lemma T2.161 we can construct the structure Rep sep (G), 
and from it the structure Sep{G). 

The tree T = T S jt£(g) i n the structure Rep sep (G) is the syntactic tree of the canonical 
term t for G. 

The terms t' in v(i) are obtained by selecting: 

- a set X of prime nodes of T corresponding to an operation 9k to be replaced by 
K , 

- for each linear node x of arity k a A:-permutation 7r such that the operation • at x 
is to be replaced by •„., 

- a set Y of leaves corresponding to reversals of edge directions. 

The sets X, Y are straightforward to specify as parameters X, Y C Nt of the MS transduc- 
tion we are constructing. 

The permutations associated with the linear nodes are obtained from a linear order =<! 
on Vq as follows. We call S2(G(x)) the leading vertex of the factor G(x) of G, for x in Nt- 
(See after Proposition 12.101 for the notation G(x)). A vertex may be leading for several 
factors. Let x be a linear node with sequence of sons yi, . . . , y^. The linear order ^ on 
Vg will be used here to permute this sequence. The leading vertices of G(y±), . . . , G(yk) 
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Figure 3: The graph M(Sep(G)). 

are pairwise distinct. There exists a unique permutation ir such that S2(G(y 7r (i))) -< ... 
-< S2(G ! (y 7r (&))). We obtain in this way a (possibly identity) permutation of the list of sons 
of x. It is clear that the new ordering of the sons of x is MS definable from =<! and the other 
relations of the structure Rep sep (G). The variable II will denote families of permutations 
of appropriate types associated with linear nodes (a /c-permutation for a node with k sons) 
and n(^) will denote the one induced as defined above by a linear order =<! on Vq- We 
denote by tx,y,u the term obtained from t by the modifications described above, based on 
X, Y and II. The set V(t) is thus the set of all terms tx,y,u- 

Claim 1: There exists an MS transduction that associates the graph defined by the term 
tx,Y,Ti{=4) with the structure (Rep sep (G), X, Y, =<Q, where X is a set of prime nodes of the 
decomposition tree T, Y a set of leaves and ^ is a linear order on Vq. 

Proof of the claim. By using X, Y and we transform Rep sep (G) into a graph H' from 
which the graph G' defined by the term tx t Y,n(4) can be obtained by the MS transduction 
that contracts the e-edges. The construction consists in modifying the e-edges in Sep(G), 
so as to represent the replacements of 9k by 9^ at the nodes in X, those of the operation 
• by •tt at every linear node where the corresponding permutation ir is specified by II(=^), 
and the reversal of edges at the leaves of Y. 

For this purpose we modify in Rep sep {G) the relation e-edg H into e-edg H i as follows: 

1) For every x in X and every son y of x, we replace the pairs ((x, i), (y, j)) and 
{(y,j), OM)) in £-edg H by ((x,3 - i), (y,j)) and ((y, j), (x,3 - i)). 

2) For every linear node with ordered list of sons y\, y^, ■ ■ ■ , yk we have in e-edg H the 
following pairs, together with their inverses: 

((*, 1), (yi, 1)), ((yi, 2), (y 2 , 1)), . . . , 2), (y k , 1)), ((y k , 2), (x, 2)) . 

We replace them by the following ones, together with their inverses: 

({%, 1), (y ff (i), l)), 2), (i/ w ( 2 ), l)), • • • 2), (y ff ( fc ), l)), ((y ff (fc),2), (x, 2)) 

where 7r is the /c-per mutation of the family n(^) associated with x, 

3) For every y in Y, we replace a triple (e, (y, 1), (y, 2)) in incfl- by (e, (y, 2), (y, 1)). 
This modification of Rep sep (G) can be done by an MS transduction using X,Y and ^. We 
obtain in this way the graph M(Sep(G)) from which can be defined by edge contractions 
the value G' of the term tx t n(=4)- ^ follows from Lemma 12 . 1 61 that G' can be obtained from 
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M(Sep(G)), whence also from (Rep sep (G), X, Y, by an MS transduction (we use here 
Proposition A. 1.2). This proves Claim 1. □ 

Since Rep sep {G) can be constructed from Inc(G) by an MS transduction, and by com- 
posing this transduction with the one just constructed, we get an MS transduction r that 
defines some graph 2-isomorphic to G from Inc(G),X, Y and any linear order =<! on Vq- To 
complete the proof, it remains to establish that every family II of permutations (of appro- 
priate types) is n(^) for some linear order =<! on Vq- This will give us that every graph 
2-isomorphic to G is obtained by r from Inc(G) for some sets X and Y and some linear 
order ^ on V^. 

Claim 2: Every family II of permutations associated with the linear nodes is n(^) for 
some linear order ^ on Fg. 

Proof of the claim. Let a family II be given. By using bottom up induction on T, we 
construct, for every node x of T, an appropriate linear order on the vertices of Vg(x) ~ 
{si(G(x))} handled as the increasing sequence Seq{x) of its elements. 

Let us note that if x and y are incomparable nodes in the tree T {incomparable means 
that no one is an ancestor of the other) then the only vertices that can be common to G{x) 
and G(y) are among the source vertices. 

We can construct the sequences Seq(x) by bottom up induction as follows: 
If x is a leaf (it corresponds to an edge of G) then Seq{x) is the sequence with single 
element S2(G(x)). 

If x = II (xi, . . . , Xk) we have sequences Seq(xi) by induction. Since two distinct factors 
G(xi) and G(xj) have only in common the sources s a (G(x)) = s a (G(xi)) = s a (G(xj)) for 
a = 1,2, the only vertex common to two sequences Seq(xi) and Seq(xj) is S2(G(xi)) = 
S2(G(xj)) = S2(G(x)), and all sequences Seq(xi) can be merged into a single one, that we 
can take as Seq{x). 

If x = »(xi, . . . , Xk) and ir is the permutation of {1, ... , k} associated with x (by the 
family II), then, since the sequences Seq(xi) obtained by induction are pairwise disjoint, 
we can take Seq{x) = Seq(x w ^) . . . Seq(x 7r ^))- By this construction, the permutation 
associated with x by any linear order for which Seq(x) is increasing is actually it. 

If x = 9k(xi, ■ ■ ■ ,Xk) we have sequences Seq{xi) by induction. Two sequences Seq(xi) 
and Seq(xj) either are disjoint or share the only vertex S2(G(xi)) = S2{G{xj)) = v, in 
the case where the two edg of K have the same target v. Hence we can merge all 

sequences Seq(xi) such that x\ has target v into a sequence L(v). Then we concatenate the 
sequences L(v) for all vertices of v of K (except for v = s\(K)), which gives Seq(x). 

Every sequence Seq(x) is a subsequence of Seq{y) if y is an ancestor of x by this con- 
struction. From the choice made for linear nodes x it follows that the sequence Seq(rootT) 
of the root of T yields the appropriate permutation of II at each x. To have a linear order 
on Vq, we add to Seqfrootr) the vertex s\(G) as very first element, proving Claim 2. □ 

This concludes the proof of the theorem. □ 

By a simple counting argument, one can see that it is impossible to specify all permuta- 
tions of arbitrarily large sets X with fc-tuples of subsets of X for fixed k. For this reason, we 
specify the permutations associated with the linear nodes by linear orders on the vertices. 
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3. Partitive families of bipartitions 

The general framework for split decomposition is defined by Cunnigham and Edmonds 
in ^H]. It applies to other cases, in particular to hypergraphs and matroids. Our presenta- 
tion owes a lot to the dissertation of Montgolfier |19j . 

3.1. Definitions and general properties. We define families of bipartitions of a set V 
associated with a partition of this set, the blocks of which are organized into an unrooted 
tree. These definitions generalize two important examples: the decomposition defined by 
Tutte of a 2-connected graph in 3-connected components and the split decomposition defined 
by Cunnigham |17j . 

Definition 3.1. Bipartitions, overlapping bipartitions. A bipartition of a nonempty set V is 
an unordered pair of subsets, P = {A, B} such that V = AUB, Af)B = 0, {A, B} ^ {0, V}. 
The sets A, B will be called the blocks of P. We denote by B(V) the set of bipartitions of 
V. Two bipartitions P and Q overlap if A _L B for all A £ P and B £ Q. Hence P and Q 
do not overlap if and only if A n B = for some A £ P and B £ Q. A bipartition of the 
form {{v}, V — {v}} does not overlap any bipartition. 

By an unrooted tree we mean a simple undirected connected graph without cycles (and 
without loops). It has no root, and the leaves are the nodes of degree 1. Its other nodes are 
the internal nodes. The sets of nodes and of internal nodes of a tree T are denoted by Nt 
and NjT respectively. For each edge e : x — y of T, we denote by T(x, y) the set of nodes, 
including x, that are reachable from x by a path that does not use edge e. 

Let T be an unrooted tree with at least two nodes and V = (V(x)) x( =n t be a partition 
of a set V such that V{x) is not empty if x is a node of degree 1 or 2. We call (T, V) a 
tree-partition of V. For each edge e : x — y of T, we let P e = {P x , Py} where P x is the union 
of the sets V(z) for z £ T(x,y), and similarly for P y with T(y,x). The family B = B(T,V) 
of bipartitions P e is not empty and satisfies the following property: 

Bl: no two bipartitions of B overlap. 
If V(x) is empty for every internal node x and is singleton for each leaf x, then B = B(T, V) 
satisfies in addition the property: 

BO: {{v}, V - {v}} £ B for every v £ V. 
For a tree-partition (T, V) we define boxx{v, x) to hold if and only if x £ Nt and v £ V(x). 
For every nonempty family B of bipartitions, we let B + = B U {{{^}, V — {v}} \ v £ V}. Since 
{{v}, V — {v}} does not overlap any bipartition, B + satisfies Bl if and only if B satisfies 
Bl. A block of a bipartition of B is called a B-block. 

Lemma 3.2. For every nonempty family B C B(V) satisfying Bl, there exists a tree- 
partition (T, V) such that B(T, V) = B. It is unique up to isomorphism. 

Proof. We first make some observations concerning B(T, V) where (T, V) is a tree-partition, 
by using the notation of the definition. 

Claim: A 23-block is minimal if and only if it is V(x) for some leaf x of T. 

Proof of the claim. It is clear that V(x) is a minimal i3-block if x is a leaf. For the other 
direction, assume that A is a minimal S-block and A = P x where P e = {P x , Py}, e : x — y. If 
x is not a leaf there is an edge x — z, z ^ y, and P z is a proper subset of P x by the condition 
that V{u) is not empty if u is a node of degree 1 or 2. Hence, this cannot happen, x is a 
leaf and A = P x = V{x). □ 
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We first prove the unicity property. Assume B(T, V) = B(T', V') for two tree-partitions. 

Let R be a minimal block. We have R = V(r) for some leaf r of T by the claim. We 
make T into a rooted (directed) tree with root r and we let s be adjacent to r in T. Let 
V\{u) = V(u) for every node u ^ r. By Definition 12.11 T(s) is the subtree of T with root 
s; its nodes are those of T except r. Clearly, J r (T(s), Vi) is the set of £>-blocks that do not 
include R. 

Similarly for B(T', V') we have R = V'(r') for some leaf r' of T', we let s' be adjacent 
to r' in T', and we have F(T'(s'),V[) = J 7 (T(s),Vi). For a node u of T(s) of outdegree 
1, hence of degree 2 in T, the set V\{u) is not empty, and the same holds for (T'(s'),V[). 
Hence, by an observation made in Definition l2.il (T(s), Vi) and (T'(s'), V[) are isomorphic. 
So are (T, V) and (T", V'), as was to be proved. 

We now prove the existence (T, V) such that B(T, V) = B where B C 0(F) satisfies Bl. 
We let be a minimal block and T be the set of £>-blocks that do not include R. Hence 
J 7 is a family of subsets of V — R that satisfies PO. It satisfies Condition PI because if A 
and A' in T overlap, then {A, B} and {A', B'} overlap since R <Z B n B' and this cannot 
happen since B satisfies Bl. 

Let (3>,Vf) be as in Definition ED Hence T(T T ,V T ) = T. We recall that V^(N) = 
N — |J{M | M is a son of N in TV}. We add a new node r linked to the root s of TV, we 
denote by T the undirected tree obtained in this way, and we let V(r) = R, V(N) = Vf(N) 
if is a node of Tjr. Then (T, V) is a tree-partition of V. In particular if N has degree 2, 
then V(N) is not empty because N has outdegree 1 in Tjr and Vjr(A) is not empty. We 
claim that B(T, V) = B. 

Let P e be the bipartition of V associated with an edge e of T. If e : r — s, then 
-P e = {it!, V — R} which belongs to B. Otherwise let e be directed x — ► y in Tjr. Then 
P e = {P x ,P y } . The set P y is the ^"-module associated with the node y of Tp. Hence 
{P y , V -P y } £ B. But V — Py = P x . Hence P e G B. 

Conversely, let P = {A, B} G B. If P = {R, V - R} it is in 0(T, V), corresponding to 
the edge r — s. Otherwise it does not overlap {R, V — R} and since R is minimal, we have 
R C A or R C -B. Assume the first. Then B £ hence is a node y of Tj?, its father is some 
x and we have {B, V — B} G 0(T, V). But A = V — B, hence P G B(T, V). This completes 
the proof. □ 

We denote by (Tg, Vg) the tree-partition associated with by Lemma HT2l It does not 
depend on the choice of r by the unicity property. It will be useful to extend this definition 
to the case where B is empty: we let then Tq consist of a single node r and Vg(r) = V. 

Lemma 3.3. Let B C B(V) satisfy Bl. For a node x G Nt b of degree k with incident edges 
e\ : x — y 1 , . . . , : x — yu, the sets P y , . . . , P Vk ( where P ei = {P x , P Vi } ) are pairwise disjoint 
and we have: 

V B ( X )=V-(Py i U---UPy k )=f]{P t x \i=l,...,k} . 

If x is a leaf, then k = 1 and Vjg(x) = P x . 

Proof. This is clear from the construction of Lemma 13.21 and the definition of in Defi- 
nition O □ 

Let C be a class of relational structures as in Section[21 For each S in C, we let B(S) be a 
family of bipartitions of the domain D$ of S satisfying condition Bl. We say that B is MS- 
definable if there exists an MS formula ip(X) such that for every S in C, {A \ {A, B} G B(S) 
for some B} = {A C Dg \ S \= (p(A)}. With these definitions: 
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Proposition 3.4. Let C be a class of 1Z- structures and B be an MS definable family of 
bipartitions of the domains of the structures inC which satisfies conditions Bl. There exists 
a domain extending MS-transduction that associates with (S, =4) where S = (Ds, (Rs)r£Tz) 
£ C and Ds is linearly ordered by =4, the structure 

Dec(S) = (D s U N Tb(S) ,(R s )r € ti, ed 9T B(S y box T B(S) ) 

such that (N TB(S) ,edg TB{s) ) = T B{S) . 

Proof. Lemma f3.2l reduces the construction of the tree T B ( S ) to that of a tree associated 
with a family T of subsets of Ds- Since the structure S is linearly ordered, one can take 
for R the unique minimal 23(<S)-block that contains the ^-smallest element of Ds- The 
corresponding family T is thus MS definable. Using Proposition 12.21 an MS transduction 
can construct the corresponding rooted tree Tjf, modified so as to yield the tree T (cf. the 
proof of Lemma l3~2|) . One gets the desired unrooted tree T B ^ S ) = (-^T B(S) , ed 9T B{S) )- The 
definition of the relation 6oxt b(S) is easy to write in MS logic. □ 

The constructed structure is, up to isomorphism, independent on ^, by the unicity 
result of Lemma 13.21 This proposition has a corollary fully similar to Corollary 12.51 of 
Proposition 12.21 

Definition 3.5. Partitive families of bipartitions. Let V be a nonempty set. A family B 
of bipartitions of V is weakly partitive if it satisfies the following property: 

B2: For every two overlapping elements P and Q of B, we have {A nB,4'U B'} £ B, 
whenever {A, A'} = P and {B, B'} = Q. 

It is partitive if, in addition, it satisfies the following property: 

B3: For every two overlapping elements P and Q of B, we have {AAB, AAB'} € B, 
whenever {A, A'} = P and {B, B'} = Q. 

Note that in B3, we have AAB' = A'AB. 

The bipartitions of B will be called the B-splits of V (or of the structure S, if B = B(S)). 
Those which do not overlap any other bipartition of B are called the good ,8-splits. (We 
keep our terminology close to that of ^Sj and ^7] which are the fundamental articles for 
these notions). If B is weakly partitive, the family Good(B) of good £>-splits is nonempty: 
let A be a minimal ,8-block among those containing an element v; if {^4, V — A} overlaps 
{B,C} where B contains v, then, by B2, {A(~)B, (V — A)\JC} G B, and A is not a minimal 
block containing v ; hence {^4, V — A} is a good B— split. Clearly, Good(B) satisfies Bl. The 
corresponding unrooted tree is TQ ood ^y If we transform a family B into B + so as to insure 
Property BO, then B + is weakly partitive or partitive if B is weakly partitive or partitive 
respectively. 

If {A, B} is a split of a structure S, we consider S as a composition of the smaller induced 
substructures S [A] and S[B]. By iterating the splitting, one reaches a decomposition of S 
into unsplittable pieces. The objective is to obtain in this way a canonical decomposition. 

As in Theorem 12 .7\ the conditions of partitivity and weak partitivity on a family B 
imply some particular structure associated with the nodes of T Good ^-j, and we also express 
this structural property for the tree T Good ^ B +y We recall that the leaves of this tree are 
the singletons {v} for v in V. If N and M are adjacent nodes of T Good ^+y we also recall 
that T Good (g+}(N, M) denote the set of nodes of T Good ^+^ (including N) that are reachable 
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from N by a path not containing M. For B denned by the context, we denote by V(N, M) 
the set of elements of V at the leaves belonging to T Good ^+^(N, M). 

Theorem 3.6. ([HI, QU) Let B C S(V) &e partitive. 

(1) Every internal node N of the tree Tg+ satisfies one and only one of the following two 
properties: 

SI: N has k neighbours, N\, . . . , iV^, k > 3, and /or every nonempty proper subset I of 
{1, . . . , k}, the pair 

B(N, I) := { (J{V(Ni, N) \ i G I},\J{V(N t , N) \ i G {1, . . . ,k} - 1} } 

belongs to B. 

S2: (1) iV /ias A; neighbours, N\, . . . , A^, > 3, and for every subset I of {1, . . . , k}, the 
pair B(N,I) (as defined above) belongs to B if and only if I or {1, ... ,k} — I is 
singleton. 

(2) If a B-split is not good, it is of the form B(N, I) for some node N satisfying Tl and a 
non singleton set I C {1, . . . , k}. 

Let B C B(V) be weakly partitive. 

(3) Every internal node N of the tree T Good ^+^ satisfies one and only one of properties SI, 
S2 or the following 

S3: N has at least 3 neighbours that can be numbered as N\, . . . , in such a way that 
for every subset I of {1, . . . , k), the pair B(N,I) belongs to B if and only if I is 
an interval [m,n] or its complement for some m,n with 1 < m < n < k (and 
{1, . . . , k} ^ [m,n]). 

(4) If a B-split is not good, it is of the form B(N, I) for some node N satisfying SI or S3 
(with m < n). □ 

The nodes satisfying SI, S2, S3 are said to be, respectively, complete, prime, and 
circular. Montgolfier's dissertation ,I]2| reviews several applications from ^H], together 
with other ones that we do not discuss here. 

3.2. The Tutte decomposition of 2-connected graphs. We review briefly the Tutte 
decomposition of 2-connected graphs used in ^H] to introduce the theory of graph decom- 
position. This notion does not depend on edge directions, hence graphs will be undirected 
in this section. They are loop- free, without isolated vertices, they may have multiple edges. 
The notation and definitions of Subsection 12.31 for 2-graphs are used here with obvious 
adaptations to undirected graphs. 

Definition 3.7. 2- separations. A 2-separation of a graph G is a bipartition {A,B} of 
its set of edges Eq such that A and B have at least two elements and there are exactly 
two vertices, u and v, which are incident with edges from both blocks of the bipartition. 
(For example {{a, b, c, /}, {d, e, g, h, k, m, n, p}} is a 2-separation of the graph G of Figure 
1.) Hence, G = (G UV [A]//G UV [B])°, where Sl (G uv [A]) = Sl (G uv [B}) = u, s 2 (G uv [A]) = 
S2(G UV [B\) = v, G UV [A]° = G[A] and similarly for B. For the purpose of iterating the 
decomposition process, it is convenient to consider that the two graphs resulting from this 
decomposition step are and G + [B], obtained from G[A] and G[B] by the addition of 

a new undirected edge u — v, labelled in a particular way, and called a marker. The graphs 
Cr + L4] and have in common the marker edge, its two ends and nothing else. They 

have no distinguished vertices. The decomposition process is applied to them recursively. 
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If G is 2-connected, then G[A] and G[B] are connected, and furthermore G + L4] and 
G + [-B] are 2-connected (|18|. Lemma 1). A graph without any 2-separation is 3-connected. 
We denote by 25(G) the set of 2-separations of a graph G. 

A decomposition of a graph H is a set of graphs (called the components of the de- 
composition) which is either {H} or the set obtained from a decomposition by replacing 
one of its components, say G, by G + [A] and G + [B] defined from a 2-separation {A, B} of 
G. This process is applied recursively to a 2-connected graph H and each component of a 
decomposition is 2-connected. The graphs in a decomposition are not disjoint, they form 
a single connected graph. If we delete from this graph the marker edges, we obtain H. 
To every decomposition corresponds a tree, the nodes of which are the components of the 
decomposition. Two nodes are adjacent if they share a marker edge. 

It is proved in ^H] that the family 25(G) for a 2-connected graph G is weakly partitive. 
If we decompose a 2-connected graph by using only good bipartitions at each step, we 
obtain at the end a canonical (unique up to isomorphism) decomposition (Theorem 1 of 
[T5]). This canonical decomposition is the one defined by Tutte and proved unique in |3*T] . 
chapter 11: every 2-connected graph has a unique decomposition in terms of bonds (i.e., 
graphs consisting of several parallel edges between two vertices), cycles and 3-connected 
graphs such that no two bonds and no two cycles share an edge. 

Proposition 13.41 is applicable and shows that the tree of the Tutte decomposition can be 
constructed by an MS transduction using an auxiliary order on the set of edges. However, 
another construction, not using any linear order is given in Theorem 4.7. It uses the 
detour through 2-dags, as in the proof of Lemma 12.201 We do not discuss any longer this 
construction. 

Question: In the case of a simple graph G, one might hope, by using Corollary 12. 121 to 
be able to construct the structure {Vq U Nt, edg G , edg T ,boxT) from {Vg, edg G ) by an MS 
transduction. However this is not immediate from the above results because the proof of 
this corollary is valid for directed graphs, and in order to orient the edges of a graph, edge 
set quantifications are necessary (see jHj). However, an alternative construction might be 
possible, giving a statement analogous to Corollarv l2.12l We leave this as an open question. 

4. The split decomposition 

In this section we apply the results of Section El to the split decomposition of graphs 
defined by Cunnigham in ^2] and used as a preliminary step in several algorithms: for the 
polynomial time recognition of circle graphs in [2j, for the recognition of parity graphs in 
[S] and for the construction of distance labellings in |21| . 

4.1. Splitting a graph. As in Subsection 12.21 graphs are simple, directed and loop-free. 
The split decomposition will be applied to connected graphs. Hence, most definitions are 
restricted to connected graphs, which permits to avoid some technical difficulties. A directed 
graph is strongly connected if for any two vertices u, v, there are directed paths from u to v 
and v to u. An undirected graph is connected if and only if it is strongly connected. 

Definition 4.1. Splitting a graph. A split of a connected graph G is a bipartition {A,B} 
of V G such that E G = E G[A ] U £ G[B] U (Ax x Bi) U (B 2 x A 2 ) for some Ai C A, B { C B, 
and each of A and B has at least 2 elements. If {A, B} is a split, then G can be expressed 
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as the union of G[A] and G[B] linked by one or two directed, complete bipartite graphs. 
(Since G is connected the set {A\ X B\) U (B2 X A2) is not empty). 

The inverse of splitting is the join operation, defined as follows. Let H and if be two 
disjoint graphs with distinguished vertices h in H and k in X. We define H^^,k) K as the 
graph with set of vertices Vh U Vr- — {/i, k} and edges x — > y such that, either a; — > y 
is an edge of if, or an edge of if, or we have x — > h in H and fc — ► y in if, or we have 
h — > y in H and x — > k in if. The subscript (/i, fc) in ^th,k) wm be omitted whenever 
possible. 

If {j4, 5} is a split, then G = H ^(h,k) K where H is augmented with a new 

vertex h and edges x — ► h whenever there are in G edges from x to some urn. B, and edges 
h — > x whenever there are edges from some u in S to x. The graph if is defined similarly 
from G[-B], with a new vertex k. These new vertices are called markers in j!7j . We say that 
h and k are neighbours if they are created from a same split. Note that the graphs H and 
if have at least 3 vertices and strictly less vertices than G. 

A technical variant (used in |17| ) consists in letting h = k. In this case the graphs 
H and if have in common the marker vertex h and nothing else. We write in this case 
G = H 

^(h,h) K- The advantage is that H U if is a single connected graph. However, the 
marker must be identified in some way. But when one iterates the decomposition process, 
it is easier to handle of the components of the decomposition as disjoint graphs. 

Definition 4.2. Decompositions. A decomposition of a connected graph G is defined induc- 
tively as follows: {G} is the only decomposition of size 1; if {G\ , . . . , G n } is a decomposition 
of size n and Gi = H ^fh,k) then {G\, . . . , H, if, Gj+i, . . . , G n } is a decomposition 
of G of size n + 1. The graphs Gi are called the components of the decomposition. The 
graph G can be reconstructed without ambiguity provided the marker vertices and their 
matchings are specified. We say that two components are neighbours if they have neigh- 
bour marker vertices. From the inductive definition of decompositions, it is clear that the 
components of a decomposition form an unrooted tree for the neighbourhood relation. 

It will be convenient to handle a decomposition T> = {G\, . . . , G n } of a graph G as a 
single graph Sdg(T>) called a split decomposition graph, an SD graph in short. The compo- 
nents of T> being pairwise disjoint, we let Sdg(T>) be their union together with particular 
edges, called e-edges between any two neighbour marker vertices. Every vertex of G is a 
vertex of Sdg(T>). The graph G can be reconstructed in a unique way from Sdg(T>). Two 
decompositions T> and T>' of a graph G are isomorphic if there exists an isomorphism of 
Sdg(T>) onto Sdg{V) which is the identity on Vq. 

The objective is to construct for every connected graph a canonical decomposition by it- 
erated good splittings. Figure 4 shows a graph G and Figure 5 shows the graph representing 
its canonical split decomposition. The dotted lines are the e-edges. 

In the perspective of obtaining a canonical decomposition, we first observe that the 
graphs H and if associated with a split {^4, B} of a graph G such that G = H M if , H 
contains A and if contains B are not always uniquely defined. Consider G: 1 — ► 2 — > 
3 — ► 4, A = {1,2}, B = {3,4}. One can take H = 1 — ► 2 — > h, if = k — >3 — ► 4, 
but one can also add an edge: h — ► 2 to H, or an edge from 3 — ► k to if (but not both 
simultaneously), and we still have G = H^^,k) However, H and if are uniquely defined 
in certain situations, as shows the following lemma. 



GRAPH DECOMPOSITIONS 



29 



a 




Figure 4: A graph G. 




Figure 5: The split decomposition graph S dg (Split (G)). 



Lemma 4.3. 

1) Let G be a strongly connected (resp. undirected and connected) graph and {A, B} be 
a split. There is a unique pair of graphs (H, K) such that H contains A, K contains 
B and G = H M K , where unique is meant up to isomorphism. 

2) Furthermore, the graphs H and K are strongly connected (resp. undirected and con- 
nected). They are isomorphic to induced subgraphs of G or to graphs obtained from 
induced subgraphs of G by fusing one vertex of indegree and one vertex of outdegree 
0. If G is undirected, only the former case occurs. 

Proof. The verifications are easy. For assertion 2), if G is undirected then H is isomorphic 
to G[A U {v }] where v is any vertex of B adjacent to some vertex of A. Otherwise, H is 
isomorphic to the graph obtained from G[A U {u, v}] by the fusion of u and v where u is 
any vertex of B such that x — ► u for some vertex x of A, and v is any vertex of B such 
that v — ► x for some vertex x of A. It may happen that u = v. □ 

Remark. These assertions are not true for nonconnected graphs: the undirected graph I4 
(where I n has n vertices and no edge) is equal to I3 M H, where H consist of one isolated 
vertex and an edge, one end of which is the marker vertex h. If G is strongly connected, 
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the graphs H and K need not be induced subgraphs: consider C 4 = C 3 IE] C 3 where C n 
denotes the directed circuit with n vertices. 

The decomposition process must terminate because the components are getting smaller 
and smaller, and are thus at the end "unsplittable" . A graph is prime if it has at least 4 
vertices and no split. The graphs with at most 3 vertices have no split for the trivial reason 
that they have not enough vertices. They are not called prime. We give easy verifiable 
examples of prime and splittable graphs. For further reference, we put them into a lemma. 

Lemma 4.4. 

1) A prime graph is 2- connected. 

2) There is no prime undirected graph with 4 vertices. 

3) For each n>5, the graph C n is prime, and the graphs P n , K n , S n -i, L n , all with n 
vertices, are not. □ 

As usual, we denote by K n the n-clique, i.e., the complete undirected graph with n 
vertices, by S n the n-star consisting of one vertex, the center, adjacent to n vertices by 
undirected edges, by P n the undirected path with n — 1 edges and n vertices, by L n the 
transitive (acyclic) tournament on n vertices (the directed graph of a strict linear order), 
by C n the undirected cycle with n vertices. The graphs K n ,S n -\ for n > 4 are "highly 
decomposable", or brittle in the terminology of j!81 117j : every bipartition, each block of 
which has at least 2 elements is a split. They are the only undirected graphs with this 
property. The highly decomposable directed graphs have a more complex structure that we 
will review later. 

The 2-connected undirected graphs having 4 vertices are K^, C4, and (i.e., minus 
one edge). None of them is prime. The directed, 2-connected graph with 4 vertices defined 
as the union of the paths a — > b — ► c — > d and a — ► d — ► c is prime, as one checks by 
trying the three possibilities to split it. 

Definition 4.5. Canonical decompositions. A decomposition of a connected undirected 
graph G is canonical if and only if: 

(1) each component is either prime or is isomorphic to K n or to S n —i for n at least 3, 

(2) no two clique components are neighbour, 

(3) two neighbour vertices in star components are both centers or both not centers. 
If G has one or two vertices, we define {G} as its canonical decomposition. 

Restrictions (2) and (3) can be justified as follows: if two clique components, isomorphic 
to K n and K m are neighbour they can be merged into a single one isomorphic to _fT n+m _2, 
by using the elimination of e-edges described below and K n+m _2 has several overlapping 
splits (n + m — 2 > 4). Similarly, if two star components, isomorphic to S n and S m are 
neighbours, and the center of one is linked by an e-edge to a non-center vertex of the other, 
they can be merged into a single star isomorphic to S n+m -i, n + m — 1 > 3, and S n+m -\ 
has several overlapping splits. It is thus necessary to assume (2) and (3) in order to obtain 
a unique decomposition theorem because stars and cliques have several overlapping, hence 
"incompatible" splits. Note that the connected undirected graphs with 3 vertices are K3 
and S2, hence are among the possible types of nonprime components. 

As in Section |31 a split is good if it does not overlap other splits. Starting from a graph 
G and the decomposition {G}, one can refine it by iteratively splitting its components with 
respect to good splits only. Since a graph breaks into two strictly smaller graphs, one reaches 
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a decomposition that cannot be refined by any split. Since one only applies good splits, 
one cannot generate neighbour components that are cliques, or that are stars with a center 
neighbour to a non-center vertex. It is thus canonical. 

The following theorem concerns connected graphs. By using the obvious decomposition 
of a graph into connected components, we get thus a canonical decomposition for every 
undirected graph. The isomorphism of decompositions is defined in Definition 4.2. 

Theorem 4.6. |171 Theorem 3] A connected undirected graph has a canonical decomposi- 
tion. It is unique up to isomorphism. It can be obtained by iterated splitting relative to good 
splits. □ 

For directed graphs, there exists a similar notion of canonical decomposition, for which 
one needs another notion of "highly decomposable" graph, called a circle of transitive tour- 
naments, a CTT in short. A CTT is a graph with n > 3 vertices vq, . . . , v n -i, such that its 
edges are described in terms of a sequence of integers = p\ < p2 < ■ • ■ < Pk < Pk+i = n as 
the pairs Vi — > Vj such that p m < i < j < p m +\ for some m, 1 < m < k. (We let v n = vo.) 
In the special case k = 1, the loop vo — > vq is excluded. The vertices v pi , . . . , v Pk are called 
the hinges. We write that this graph is a k-CTT to specify the number k of hinges. 

A CTT is strongly connected and is not undirected. Each of its splits has a block of 
the form {vi, . . . , Vj} for some i, j with 0<i<j<n — 1. 

Here are some examples: For k = n, one gets a directed circuit. For k = 2, n = 4, 
pi = 0,£>2 = 2,£>3 = 4 one gets the graph — > 1 — > 2 — > 3 — ► with additional edges 
— ► 2 and 2 — ► 0. For k = l,n = 3, one gets the graph — ► 1 — ► 2 — ► with 
additional edges 1 — > and — > 2. A 1-CTT with n vertices has all its vertices of degree 
n, except the hinge which has degree 2n — 2. (Since graphs are defined as directed, a vertex 
in a loop- free directed graph with n vertices has maximum degree 2n — 2). 

A decomposition of a strongly connected graph G is canonical if and only if: 

(1) each component is either prime, or is isomorphic to K n or to S n -\ for n at least 3, 
or is a CTT, 

(2) and (3) hold as for undirected graphs, 

(4) if two neighbour components are CTTs and each of them has at least two hinges, 
then the neighbour vertices are not two hinges. 

If G has one or two vertices, we define {G} as its canonical decomposition. 

To justify the roles of cliques and stars, we recall that an undirected edge is defined 
as a pair of opposite directed edges. If two neighbour components in a decomposition are 
CTTs with respectively k and m vertices and k' + 1 and m' + 1 hinges, and two hinges are 
linked by an e-edge e, then they can be merged (by what we will call in the next subsection 
the elimination of e) into a single (A/ + m')-CTT with k + m — 2 vertices. This is shown 
on Figures 6 and 7 : two 3-CTTs are merged into a single 4-CTT. In all other cases where 
two CTTs are neighbour, the elimination of the e-edge linking them does not yield a CTT, 
a star or a clique. 



Theorem 4.7. |17| Theorem 2] A strongly connected graph has a canonical decomposition. 
It is unique up to isomorphism. It can be obtained by iterated splitting relative to good 
splits. □ 
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Figure 6: Two 3-CTT's linked by an e-edge. 





Figure 7: Two CTT's merged into a 4-CTT. 



The split decomposition of a strongly connected graph G (which includes the case of a 
connected undirected graph) denoted by Split (G) is the canonical decomposition of Theo- 
rems EHl an dE3 I n the next subsections we define its representation by a graph, and its 
construction by an MS transduction. 

4.2. Graph representations of decompositions. We have defined a single graph Sdg(D) 
linking all components of a decomposition T>. We obtain in this way a binary relational 
structure on a fixed finite signature, actually an edge-labelled graph, from which the decom- 
posed graph can be reconstructed by an MS transduction, as we will see. We get something 
similar to the graph representation of modular decompositions defined in Section |2I 

Definition 4.8. Split decomposition graphs. A split decomposition graph, (an SD graph in 
short) is a graph H with two types of edges, defined as a triple (Vh, edg H ,e-edg H ) satisfying 
the following conditions 

(i) the solid edges are represented by a binary relation edg H , and the (undirected) 
e- edges are represented by a symmetric binary relation e-edg H ; 

(ii) every vertex is incident to a solid edge; 

(iii) no two e-edges are adjacent; 

(iv) the graph obtained from H by contracting the solid edges is an undirected tree. 

Condition (iv) implies that a cycle in H can only consist of solid edges, and that H 
is connected. The connected components of the subgraph H[Ef^ 1 ] where E^ 1 is the set of 
solid edges, are called the components of H. They are not isolated vertices and they are 
linked to one another by e-edges, in the global shape of a tree. 

Lemma 4.9. The graph Sdg(D) associated with a decomposition T> of a connected graph G 
is an SD graph. Its vertices incident to no e-edge are the vertices of G. □ 

We now explain how a graph can be reconstructed from the SD graph Sdg(D) repre- 
senting one of its decompositions. 

Definition 4.10. Evaluating SD graphs If e is an e-edge u — v of an SD graph H, we define 
an SD graph H' = Elim e (H) as follows: 
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(a) V H > =Vh- {u,v}, 

(b) the edges of H' are those of H not incident to u or v, and the edges x — ► y if 
x — ► u — v — > y or x — > v — u — ► y in H. (The edges x — > u, v — > y, x — ► v 
and u — > y are necessarily solid edges). 

We will say that this operation eliminates the edge e. 

Lemma 4.11. If H and K are two disjoint graphs and G is their union with an e-edge 
linking h in H and k in K , then Elim e (G) = H ^(h,k) K- If e an d f are two e-edges of an 
SD graph H , we have Elimf(Elim e (H)) = Elim e (Elimf(H)). 

Hence one can eliminate simultaneously (or in any order) the e-edges of a given set. We 
let Eval(H) be obtained by eliminating all e-edges of an SD-graph H. We use the notation 
Eval because we consider this mapping as the evaluation of a kind of algebraic expression, 
the operations of which are defined by the components of H. 

Lemma 4.12. For an SD graph H, the graph G = Eval(H) can be defined as follows 
(a') Vg is the set of vertices of H incident to no e-edge, 

(b') the edges of G are the solid edges of H not adjacent to any e-edge and the edges 
x — ► y such that there is in H a path 

x — ► m - vi — ► u 2 - v 2 — > ■ ■ ■ — ► u k - v k — ► y 

where the edges Ui — Vi are e-edges and alternate with solid edges. □ 

Example 4.13. The following graph H is an SD graph: 

a — ► b — > u — v — > c < — u — v < — d < — u" — v" — > e 

and Eval(H) is the non connected graph: a — ► b — > c < — d e. This example shows that 
not every SD graph is associated with a decomposition of a connected graph. 

Proposition 4.14. If V is a decomposition of a connected graph G, then Eval{Sdg(V)) 
= G. 

Proof. By induction on the size k of T>. If k = 1, then we have Sdg(T>) = G. For the induc- 
tion step, we let V = {G\, . . . , G^} be a decomposition with corresponding graph Sdg(T>') 
such that Eval(Sdg{V')) = G. We prove the assertion for V = {Gi, . . . ,G k -i,M, M'}, 
obtained by splitting one component, say G k without loss of generality. The graph Sdg(V) 
is obtained from Sdg(T>') by the replacement of the subgraph G^ by the union of M and 
M' linked by an e-edge, say e, and Elim e {Sdg(V)) = Sdg(V). We have Eval{Sdg(V)) = 
Eval(Elim e (Sdg{V))) = Eval(Sdg{V')). Since Eval(Sdg{V')) = G by the induction hypoth- 
esis, we obtain Eval(Sdg(V)) = G. □ 

The notion of clique-width of a directed or undirected graph G, denoted by cwd(G), 
and a few results about it, are recalled in Appendix 2. It is defined for graphs with labelled 
edges, hence is applicable to SD graphs. 

Proposition 4.15. The mapping Eval from SD graphs to graphs is an MS transduction. 
There exists a function f such that Eval(H) has clique-width < f(k) if each component of 
an SD graph H has clique-width < k. 

Proof. That the mapping Eval is an MS transduction is clear from its definition and the 
fact that the transitive closure of an MS definable binary relation is MS definable. 
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For the second assertion, we use the fact for every MS transduction r, there exists 
a function / such that cwd(G') < f(cwd(G)) whenever G' is obtained from G by r. (See 
Lemma A. 2. 2). Hence assuming that each connected component of H[E^ 1 ] has clique-width 
< k, it is enough to prove that cwd(H) < k + 2. 

We need a few technical facts about the algebraic expressions defining clique-width. 
We recall here that if C is a set of k labels, a C-expression defining a graph G witnesses 
that G has clique- width at most k (full definitions in Appendix 2). Let C and D be disjoint 
sets of labels. Let M be a graph with pairwise distinct vertices v±, . . . , v m . Let N±, . . . , N m 
be pairwise disjoint graphs such that iVj has in common with M the single vertex V{. We 
assume that M is defined by a C-expression, and that each Ni is defined by a (C U D)- 
expression, its vertices are labelled in D, and Vi has a label rj that is different from those 
of the other vertices of iVj . 

Claim 1: The graph L = M U Ni U • • • U iV m can be defined by a (C U D)-expression. 

Proof of the claim. Let E be an expression defining M. It has occurrences of constants 
pi, . . . ., p m which define respectively the vertices v±, . . . , v m . Let F±, . . . , F m be (C U D)- 
expressions defining respectively N±, . . . , N m . The expressions F- = ren ri ^ Pi (Fi) define the 
graphs Ni with Vi now labelled by pi. The desired (C U L>)-expression for L is obtained 
by substituting in E the expressions F[, ..,F' m for the occurrences of pi, . . . .,p m defining 
vi, . . . ,v m , giving an expression E' . Since E does not contain operations involving labels 
in D, the substitution of the expressions F[, ■■,F{ n in E does not result in edge creations 
between the vertices of the graphs Ni other than Vi and the vertices not in iVj. Hence, E' 
is a (C U D)-expression defining L. □ 

We now continue the proof of the proposition. Let H be an SD graph with components 
of clique-width < k. We wish to prove that cwd(H) < k + 2. The case where H has a single 
component is obvious. 

We let D = {T, _L}, and C be a set of k labels. For every e-edge e : v — u, we let H v ^ e 
be the subgraph of H consisting of v, e and the connected component of u in the graph H 
minus the edge e. (We recall that the e-edges link the components of H in the global shape 
of a tree). We label v by T and all other vertices of H Vfi by _L. 

Claim 2: Each graph H V:B labelled in this way is definable by a (C U D) -expression. 

Proof of the claim. The proof is by induction on the number of e-edges of H v ^ e . We let M 
be the component of H containing u, the other end of e. It is a subgraph of H v>e . By the 
hypothesis, M is defined by a C-expression. We let v±, . . . ,v n be the other vertices of M 
incident with e-edges, respectively e±, . . . , e n , which are the e-edges linking M at vertices 
vi, . . . ,v n to other components of H. Using induction, we obtain that each graph H Vuei is 
definable by a (C U Z))-expression. We let N be the edge e, with v labelled by T and u 
labelled by _L. Claim 1 is applicable to the graph L = M U H Vl>ei U • • • U H Vnt e n U N, which 
is equal to H v e . Hence H v ^ e is definable by a (C U Z))-expression. 

This argument applies for n = which is the basis of the induction. □ 

The graph H is itself is expressible as MUH vltei U- ■ -UH Vnten where M is any component, 
using the notation of Claim 2. Its proof yields the desired result since the cardinality of 
C U D is k + 2. This completes the proof of the proposition. □ 
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We leave as an open question to determine a good bounding function /. 

Proposition 4.16. A set of strongly connected graphs has bounded clique-width if and only 
if the prime components of their split decompositions have bounded clique-width. 

Proof. We first consider undirected graphs (for them strong connectedness is just connect- 
edness). The "only if direction is clear because the prime components of the split decom- 
position of an undirected graph are isomorphic to induced subgraphs of this graph, and 
clique-width is monotone with respect to induced subgraph inclusion. (See Lemma A.2.1). 

For the other direction, it suffices to apply Theorem 14.61 and Proposition 14. 151 knowing 
that the cliques K n and the stars S n have clique-width 2. 

We now consider directed graphs. We will use Theorem 14. 71 For the "only if direction, 
we note that, by Lemma l4,3l a prime component M of the split decomposition of a strongly 
connected graph G is either an induced subgraph of G or is obtained from an induced 
subgraph N by the fusion of a vertex of indegree and a vertex of indegree 1. In this case, 
cwd(N) < k implies cwd(M) < 4k by Lemma A. 2. 3. 

For the "if direction, we argue as above, and it remains to prove that CTTs have 
bounded clique-width. Actually they have clique-width at most 4. Let G be a fc-CTT 
with vertices vo,. . . , u n -l , n > 3, and edges V{ — ► Vj such that p m < i < j < p rn +i for 
some m, 1 < m < k, where = p\ < p2 < • • • < Pk < Pk+i = n and v n = vq. For every 
i = 0, . . . , n — 1, we let Gi be the subgraph of G defined as follows 

(a) its vertices are vo,...,Vi , 

(b) its edges are those of G of the form Vj — * Vf. for < j < k < i (hence G n _i is G 
minus the edges towards Vq); 

(c) its vertices are labelled as follows: we label vq by 1; letting m be such that p m < 
i < Pm+i, we label Vj by _L if < j < p m and we label Vj by 2 if p m < j < i. 

The graphs Gi are defined by the following expressions: 

- G x = 0(^1,2(10 2); 

- if 2 < i < p 2 , then Gi = ren^- > 2{add2fi{addi^(Gi^i © 3))); 

- if p m < i < p m+ \ and m > 2, then Gi = ren 3 ^ 2 (add2,3(Gi~i © 3)); 

- if i = p 2 > 1, then Gi = ren^2{ren 2 ^±{add2fi{addi^(Gi-.i © 3)))), 
and finally 

- if i = p m and m > 2, then Gi = ren^2{fen2^,A_{add2^{Gi-i © 3))). 

Then G = add2 t i(G n -i). This shows that G can be constructed with the 4 labels 1,2,3, _L 
hence has clique-width at most 4. If G has a single hinge, then n — 1 < p2 = n and labels 
1,2,3 suffice. Thus 1-CTTs have clique-width at most 3. □ 

Remark. The clique-width of a graph may be strictly larger than the maximum clique- 
width of the components of its split decomposition. For an example the clique-width of P4 
is 3, -P4 = P3 M P3 and the clique-width of -P3 is 2. By contrast, the clique-width of a graph 
is the maximum clique-width of its prime components for the modular decomposition (by 
Lemma A.2.1). 

Another complexity measure for undirected graphs called rank-width is defined by Oum 
and Seymour (see ^3 HI]). It is equivalent to clique- width in the sense that the same 
sets of undirected graphs have bounded clique-width and bounded rank-width (because 
rwd(G) < cwd(G) < 2 rwd[ - G ^ +l - 1 where rwd(G) denotes the rank-width of G). The 
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rank-width of a graph is the maximal rank-width of its prime components for the split 
decomposition. 

4.3. Monadic Second-Order definition of the split decomposition. The following 
theorem is actually the basis for Theorems 14.61 and 14.71 

Theorem 4.17. [171 Theorem 9]; The family of splits of a strongly connected graph is 
weakly partitive. The family of splits of a connected undirected graph is partitive. □ 

Remark. This result may not hold for a graph that is not strongly connected. Take for 
example: 1 < — 2 — > 3 — > 4 < — 5 — ► 6 with additional edge 6 — > 1. The two splits 
{{1,2, 3}, {4, 5, 6}} and {{2, 3, 4}, {5, 6, 1}} overlap but {{2, 3}, {4, 5, 6, 1}} is not a split. 
Hence, the family of splits of this graph is not weakly partitive. 

We denote by BS{G) the family of splits of a graph G, and by BS g {G) the family of 
good ones. The tree T SSg ^ G ) (defined in Section EJ) is the tree of the split decomposition 
Split (G). To simplify the notation, we will denote it by Tqs^, remembering that it is 
based on good splits. Proposition 13.41 yields the following: 

Proposition 4.18. There exists an MS transduction that associates with a strongly con- 
nected graph G and a linear ordering =^ of its set of vertices the structure 

(V G U N Tbs(G) , edg G , e^T es(G) 

such that T BS{G ) = {Nt bs{g) . ed 9T BS{G) ) ■ □ 

From the tree T S g^ G ) > we build an SD graph Hq S ^ G ) and we will prove that it represents 
Split (G), i.e. that Hbs(G) = Sdg (Split (G)). 

Definition 4.19. The SD graph Hq^^g) constructed from Tbs(g)- To avoid special cases, 
we assume that G has at least 3 vertices. The tree-partition (Tbs{G)i^BS(G)) ls defined by 
Lemma l3.2l from the family BS g (G) (the set of good splits, which do not overlap any other). 
We let N be the set of nodes of the unrooted tree T BS ^ G )- An edge e : x — y of T S s(G) 
corresponds to a bipartition {P x ,P y } G BS g (G). 

For each such edge, we create two new vertices (e, x) and (e, y): they will be the marker 
vertices of Definition 14.11 More precisely, the nodes of Tbsig) correspond to the components 
of the split decomposition, and the markers of the component at a node x will be the vertices 
(e, x) for all edges e of T incident with x. 

For a node x £ N with neighbours y 1 , . . . , yk we let P y , . . . , P Vk be the sets associated 
with the edges e% : x — y 1; . . . , : x — (we use the notation of Lemma l3~3"|) . They are 
pairwise disjoint. By this lemma, Vg l s(G)( x ) = Vg — {Py 1 U- • -UP^J (this set may be empty). 
We define a graph H (x) as follows: 

(i) V H ( X ) = Vbs(G){x) U {(e»,x) \ i = l,...,k}, 

(ii) its edges are of several types: 

- the edges u — ► v in G, for u, v £ Vjss(G){ x )i 

- the edges u — ► (ej,x) if u G Vgs(G)( x ) an d there is in G an edge u — > v for some 
v in P yi , 

- the edges u < — (ej,x) if u G Vbs(G)( x ) an d there is in G an edge u < — v for some 
v in P Vi , 
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- the edges (ei,x) — > (ej, x), i ^ j if there is in G an edge u — ► v for some u in P yi 
and some v in P y . . 

As we will prove, these graphs are the components of the split decomposition. In order 
to obtain an SD graph H BS ^, we take their union and we link them by undirected e-edges 
between (e, x) and (e, y) for every edge e : x — y of T BS ^ G y This completes the definition of 
Hgg(Q\. If G has no good split, then BS g {G) is empty, the tree T BS ^ has one node and 
no edge, and H B s(G) = G. 

Proposition 4.20. If a graph G is strongly connected with at least 3 vertices, we have 
h bs(G) = Sdg(Split(G)) and Eval(H BS{G) ) = G. 

Proof. The proof is by induction on the number of vertices of G. 

1) The case of graphs with 3 vertices is checked directly: each graph is a clique, a star 
or a CTT, hence is necessarily a component, BS{G) is empty, and Hbsog) = G. 

2) If G has no good split, then it follows from ^Jj; Theorems 10 and 11, that G is 
either S n , or K n , or a CTT, or is prime. In all cases we have H BS ^ = G. 

3) If none of these cases hold, then G has a good split {^4, B} and G can be written 
as H M K in a unique way fLemma l4.3|) with Vh 2 A, Vk B. We let h and k be 
their marker vertices (cf. Definition 14.1)1 . 

Claim 1: The tree Tbs(g) is the union of the trees Tbs(h) and Tqs(k) linked by an edge 
between x and y, where x is the node of T BS ^ such that h G Vqs(h)( x ) an d y is the node 
°f T BS(K) such that k G Vbs(k){v)- 

Proof of Claim 1. Property F3 of Theorem 8 of ^7], states that for a split {A, B}, if A' C A, 
then {A', B U A - A'} is a split of G if and only if {A', {h} U A - A'} is a split of It 
follows that if {A, B} is a good split, then, with H and if associated with it as above: 

BS g {G) = {{A, B}} U {{A', CUB}\ {A', C U {h}} € BS g {H)} 

U {{£', C U A} | {5', C U {k}} e BSg(K)} . 

This fact gives the bijection between T BS ^ and the union of the trees T^stH) an d Tqs(k) 
linked by an edge as in the statement. The edge x — y corresponds to {^4, B}. □ 

Claim 2: The graph Hbs(G) ls isomorphic to the union of the graphs H&srm and Hbs(k) 
linked by an e-edge between h and k. 

Proof of Claim 2. Let e be the e-edge linking h and k. Let Xh and x^ be the nodes of 
T BS(H) and T BS{K) such that h G V BS{H) {x h ),k G V BS ( K) (x k ). 

We denote by Hqs(h) + H B s(K) the union of the graphs Hbs(h) an d Hbs(k) together 
with e where h is replaced by (e,Xh) and k by (e, xjfc). 

Our goal is to prove that H&s(G) = Hbs(H) + BiBSlK) ■ By Claim 1 and the definitions, 
the vertices of the graph H B s(G) are those of Hbs(h) + Hbs(K)- It remains to prove that 
the edges are the same in both. 

This is clear for the e-edges as an immediate consequence of Claim 1. We now consider 
the various types of solid edges. 

a) A solid edge of the form u — ► v, u, v G Vbs(G)( x )-> where none of u and v is a vertex 
(f,y), is in H B s(G) if and only if it is in H BS ( H) + H BS ( K ) because V B s{G){ x ) = 
Vbs(h) ( x ) H Vq for x a node of T B s(H) an d similarly for K. 
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b) Consider a solid edge (ej,x) — ► (ej,x). Without loss of generality, we assume that 
x is a node of T BS r H \ . 

Consider such an edge in H^s(G)' there is in G an edge it — > v for some it in P yi 
and some u in P y . , where yi, . . . ,y n are the neighbours of x in TggrQ\ as in Definition 

Subcase 1: One of (ej,x) or (ej,x), say (ej,x), is (e, x^). 

Then we have it — > u in G with i; G Py. = B. Hence, we have an edge (e^, x) — ► h 
in P, hence the edge (ei,x) — > (ej,x) in H^s(H) + Hbs(k) since (e,^) = (ej,x) 
replaces /i. 

Subcase 2: None of (e*, x), (ej, x) is (e,Xh) or (e, X&), u and v are both in if, and 
they are not h (because it — > v is an edge of G). 

Then the edge (ej,x) — > (ej,x) is also in P,gS(.ff)> because if we denote by P y . the 
blocks like P yi relative to P, then we have either P y . = P yi or P y . = P yi — Vk U {h}, 
by the result recalled in the proof of Claim 1. 

Subcase 3: As in the previous subcase except that one of u, v, say u is in P, and 
the other is in K. 

Then the edge it — ► h is in P, and we also have the edge (ei,x) — > (e-/,x) in 
Hbs(h) because h G P^ , since P^ = P % — Vk U {/i}, with the notation of the 
previous subcase. 

Conversely, let us assume that (ej,x) — > (ej,x) in Hjggrjjy We have in P an edge 
it — > u for some it in P' and some u in P' . . 

Subcase 1: None of u,v is h, then we have also (ei,x) — > (ej,x) in Hqs(G)i using 
the observation on the blocks P yi ,P y . made above in Subcase 2. 

Subcase 2: If it = h, then we have w — > u in G for some id in ET. Hence (ei,x) 
— ► (e i5 x) is in H BS{G) . 

The arguments are of course the same with K in place of P. 

c) Consider a solid edge it — ► (ej,x) in Pbs(gq, u £ Vg5((j)(x). There is in G an edge 
u — > v for some v in P yi , where &\ : x — y%, . . . , e n : x — y n are the edges of T BS ^ 
incident to x, as in Definition 14.191 There are several subcases: 

Subcase 1: u G V#, (ej,x) = (e, x/j). 

Then u G V^, but we have u — ► /t in PgS(H)- Hence the edge u — > (e, x/j) is in 

Hbs(H) + Hss(K) ■ 

Subcase 2: u G (ej,x) 7^ (e,Xh)- 

Then (ej,x) is in Psstm. Either u G Vjy, and then the edge it — ► (ej,x) is also in 
Hbs(h) ot v E Vk, so the edge u — > h is in H and the edge u — ► (ej, x) is also in 
#BS(H) because h E P y .. 

The argument is similar if u G Vr- and for the edges u < — (ej,x). 
Consider conversely a solid edge u — ► (ej,x) in Pss(h), u G V(x), u ^ h. There 
is in P an edge it — > v, where 1; in P^. (a block relative to P, same notation as in 
case b). 

If v = h, we have it — > it; for some it; G Vjf, hence u — ► (e^, x) in Pe<s(G)- I£v=£h, 
we have it — > 1; in G, hence also u — > (e«, x) in Pg5(G)- 

Again the argument is similar for a solid edge it — ► (ej,x) in H BS ^ K ^ and for the 
edges it < — (ej,x). □ 
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We can now complete the Proof. We have G = H M K . By induction, we can assume 
that H BS ^ = Sdg(Split(H)) and H BS ( K ) = Sdg (Split (K)). Using the notation of Claim 
2, the SD graph Sdg(Split(G)) is, by its definition, equal to Sdg(Split(H)) + Sdg (Split (K)). 
Hence, by Claim 2 and these equalities following from induction, it is isomorphic to H^sfG)- 
This completes the proof. □ 

Theorem 4.21. There exists an MS transduction that associates with a linearly ordered 
strongly connected graph the SD graph representing its canonical split decomposition. 

Proof. By Proposition 14.181 we have an MS transduction associating with (G, =4) the struc- 
ture (Vq U Nt, edg G , edg T ,boxT) where T is the tree of the canonical decomposition, i.e, 
T = Tks( G ) . 

The next task is to specify the pairs (e, x) for the edges e of T and their nodes x as 
pairs (u, i) for u in Vg U Nt and integers i in a fixed finite set. By using the ordering of 
Vg one can select the leaf r of T which contains a smallest vertex of G. We make T into a 
directed tree with root r. This orientation is MS definable. For an edge e of T, directed, 
say : x — ► y, we can define (e,x) as the pair (y, 1) and (e, y) as the pair (y, 2). Since T 
is a directed tree, each edge is specified in a unique way by its target. Hence, the vertex y 
refers to a single edge e. 

Hence the set of vertices of -ffss(G) is defined as Vg x {1} U (Nt — {r}) x {1, 2}. The 
conditions defining the edges of the graph H^sfG) are straightforward to express in MS 
logic, provided for each edge of T one can determine the corresponding good split. This is 
possible using the relation boxT- D 

Hence we have proved that the split decomposition of a strongly connected graph is 
definable by an MS transduction from the graph and a linear order of its vertices. It 
follows from Proposition A. 1.1 (in Appendix 1) that a property of graphs expressed as 
an MS property of their prime components and/or of the underlying trees of their split 
decompositions is an order-invariant MS property. 



5. Conclusion 

In this article, we have applied Monadic Second-Order logic to the graph decomposi- 
tions which follow the pattern of modular decomposition and to those defined in the frame- 
work of Cunnigham and Edmonds JH]- We have established general definability results in 
Monadic Second-Order logic, and we have applied them to the canonical decompositions 
of 2-connected graphs. We have obtained as new results a logical expression of Whitney's 
2-isomorphism Theorem and the definability in Monadic Second-Order logic of the split 
decomposition of Cunnigham ^21- The article [Hj applies this result to circle graphs stud- 
ied in the framework of Monadic Second-order logic. This application is presented in the 
Introduction. 

Here are some open questions (a few others are presented also in the main text). 

Question 1: The split decomposition works well for undirected graphs and for strongly 
connected directed graphs, because these graphs have canonical decompositions. What 
about connected directed graphs ? The strongly connected components of a graph G form 
a directed acyclic graph D. Directed acyclic graphs have unique modular decompositions. 
However, it is not clear how to combine the modular decomposition of D and the split 
decompositions of the strongly connected components of G in order to obtain a notion of 
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canonical decomposition subsuming these cases. Although directed graphs have no canon- 
ical split decomposition, it may be useful to construct non canonical ones for algorithmic 
purposes or for investigations on the structure of graphs. 

Question 2: Our logical formalization of decompositions, based on families of sets 
and on families of bipartitions can be applied to hypergraphs (along the lines of [1]), to 
^-structures which are also hypergraphs (see [211 )> to matroids (the MS logic of matroids 
has been studied by Hlineny [25)- These applications should be developped. 

Question 3: Another topic for future research is the extension of split decomposition 
to countable graphs, generalizing what is done in |13| for modular decomposition. 
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Appendix 1: Monadic second-order logic 

We review Monadic Second-Order (MS) logic and transformations of structures ex- 
pressed in this language, called MS transductions. The reader is refered to the book chapter 
10 , or to the preliminary sections of the articles [3 El f° r more detailed expositions. 
However all necessary definitions are given in full in the present section. 

Relational structures and monadic second-order logic. Let R = {A, B,C, . . .} be 
a finite set of relation symbols each of them given with a nonnegative integer p(A) called 
its arity. We denote by ST1Z(R) the set of finite -R-structures S = {D$, (As)agr) where 
A s C D P S {A) if A G R is a relation symbol. If R consist of relation symbols of arity one or 
two, then we say that the structures in ST1Z(R) are binary. 

A simple graph G can be defined as an {e<i<?}-structure G = (Vg, edg G ) where Vq is 
the set of vertices of G and edg G C Vg X Vg is a binary relation representing the edges. For 
undirected graphs, the relation edg G is symmetric. If in addition we need vertex labels, we 
will represent them by unary relations. Binary structures can be seen as vertex- and edge- 
labelled graphs. If we have several binary relations say A,B,C, the corresponding graphs 
have edges of types A, B, C. 

We recall that Monadic Second-order logic (MS logic for short) is the extension of 
First- Order logic (FO logic for short) by variables denoting subsets of the domains of the 
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considered structures, and new atomic formulas of the form x G X expressing the mem- 
bership of x in a set X. (Uppercase letters will denote set variables, lowercase letters will 
denote first-order variables). 

We denote by FO(R, W) (resp. by MS(R,W)) the set of First-order (resp. Monadic 
Second-order) formulas written with the set R of relation symbols and having their free 
variables in a set W consisting of first-order as well as of set variables. Hence, we allow 
first-order formulas with free set variables and written with atomic formulas of the form 
x G X. In first-order formulas, only first-order variables can be quantified. 

As a typical and useful example of MS formula, we give a formula with free variables 
x and y expressing that (x, y) belongs to the reflexive and transitive closure of a binary 
relation A: 

\/X(x G X A Vtt, v[(u £ X A A{u, v)) => v G X] => y G X) . 

If the relation A is not given in the structure but defined by an MS formula, then one 
replaces A(u, v) by this formula with appropriate substitutions of variables. 

A monadic second-order (MS) property of the structures S of a class C C ST1Z(R) is a 
property V such that for S G C: 

V(S) holds if and only if S \= p , 

for some fixed formula <p in MS{R,0). Let < be a binary relation symbol not in R. A 
formula ip in MS(R U {<}, 0) is order-invariant on a class C, if for every S G C, for every 
two linear orders =^ and on the domain D$ 

(S, =4) \= ip if and only if (S, =^') 1= ip , 

where ^ and interpret <. We say that V is an order-invariant MS property of the 
structures of a class C C ST1Z(R) if and only if 

P(iS r ) holds if and only if (fi 1 , =<!) (= for some linear order on D$ , 

where </3 is a fixed order-invariant MS formula. Order-invariant MS properties are investi- 
gated in |H] . A difficulty with this definition is that the set of order-invariant MS formulas 
is undecidable. However, we will use formulas that are order-invariant by construction. 

Monadic Second-order transductions. We will also use FO and MS formulas to define 
certain graph transformations. As in Language Theory, a binary relation 7Z C A x B where 
A and B are sets of relational structures will be called a transduction: .4 — > B. 

An MS transduction is a transduction specified by MS formulas. It transforms a struc- 
ture S, given with an n-tuple of subsets of its domain called the parameters, into a structure 
T, the domain of which is a subset of D$ x {1, . . . , k}. Furthermore, each such transduction, 
has an associated backwards translation, a mapping that transforms effectively every MS 
formula^ relative to T, possibly with free variables, into one, say (p&, relative to S having 
free variables corresponding to those of ip (k times as many actually) together with those 
denoting the parameters. This new formula expresses in S the property of T defined by <p. 
We now give some details. More can be found in [71 HUj. 

We let R and Q be two finite sets of relation symbols. Let W be a finite set of set 
variables, called parameters. A {Q , Redefinition scheme is a tuple of formulas of the form 

A = (<p,ipi,--- ,^ k ,(dw)weQ*k) where k > and Q*k := {(qj) \ q G Q,Jg [fc]^} , 
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<p G MS(R,W),ipi G MS{R,W U{xi}) for i = 1, • • ■ , jfe, and 

6> w G MS(R, W U {xi, • • • ,x p((?) }) for W =(g,J)GQ*A;. 

These formulas are intended to define a structure T in STTZ(Q) from a structure 5 in 
ST1Z(R). Let 5 G ST1Z(R), let 7 be a I^-assignment in S. A Q-structure T with domain 
Dt C D5 x [fe] is defined in (S, 7) by A if 

« (S,7)l=¥>, 

(ii) L> T = {(d,z) I rfe Ds.i G [fe],(5, 7 ,d) |=^}, 

(iii) for each q in Q 

q T = {((di,«i),-- - ,(dt,*t)) G I (S,7,rfi," - ,<*t) N (9 J)} , 

where j = ■ ■ ■ ,it) and t = p(q). 

The notation S \= ip means that the logical formula ip holds true in the structure S. 
By (S, 7, di, ■■■ ,d t ) \= Qr q jy we mean (5, 7') |= 0/ jw where 7' is the assignment extending 
7, such that 7'(xj) = for alH = 1, • ■ ■ , t; a similar convention is used for (S, 7, d) \= ipi). 

Since T is associated in a unique way with 5,7 and A whenever it is defined, i.e., 
whenever (S, 7) |= if, we can use the functional notation def ^(S, 7) for T. The transduction 
defined by A is the binary relation 

Pa := {(S, T) I T = def^(S,^f) for some M^-assignment 7 in S 1 } . 

Hence £> A C STK{R)xSTK{Q). A transduction / C STU{R) x STTZ{Q) is an MS 
transduction if it is equal, up to isomorphism of structures, to £>a for some (Q, .Redefinition 
scheme A. 

An MS-transduction is defined as a binary relation. Hence it can be seen as a "non- 
deterministic" partial function associating with an Restructure one or more Q-structures. 
However, it is not really nondeterministic because the different outputs come from different 
choices of parameters. In the case where W = 0, we say that the transduction is parameter- 
less ; it defines a partial function. It may also happen that different choices of parameters 
yield isomorphic output structures. This is the case in the example of edge contraction 
detailed below. 

We will refer to the integer k by saying that A and £>a are k- copying; if k = 1 we 
will say that they are noncopying. A noncopying definition scheme can be written more 
simply: A = (<p,ip,(9 q ) qe Q). We will say that an MS transduction is domain extending, 
if the formula tpi of its definition scheme A is the Boolean constantTr-ue. In this case, if 
T = def A (S ', 7) , then Dt contains D$ x {!}, an isomorphic copy of Ds- This transduction 
defines the domain of T as an extension of that of S. If in the definition scheme A we only 
use FO formulas, then we will say that Pa is an FO transduction. 

Example. Edge contraction. We consider a graph G with two types of edges, the 
ordinary edges and the e-edges. It is represented by a structure (Vq, edg G ,e — edg G ) where 
the binary relation e — edg G represents the e-edges. We want to define from G the graph 
H obtained by the contraction of all e-edges. 

It is formally defined as (Vh, edg H ) where Vh = Vq/ ~, ~ is the equivalence relation 
such that x ~ y if and only if x and y are linked by an undirected path made of e-edges, 
and edg H ([u], [v]) holds if and only if x G [it], y G [v] for some (x,y) in edg G ([u] denotes 
the equivalence class of u). The MS formula £,(x,y) defined as 

\/X[(x G X A Vit, v{u G X A (e - edg(u, v) V e - edg(v, u)) => v G X}) =>- y G X] 
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expresses x ~ y. For defining Vh we must select a set containing one and only one vertex 
of each equivalence class. This can be done with a set variable Y that will be a parameter 
of the MS transduction, satisfying the formula <p(Y) defined as\/x3\y[y GFA £(x,y)]- 

Edge contraction can be defined by the transduction with noncopying definition scheme 
A = (ip, ip, 9 e dg) where^>(Y, x) is x G Y and O e d g {Y, x, y) is 3u, v[x € Y A y G Y A edg(u, v) A 
£(x,u) A£(y,v)]. 

Notice that the structures associated with all values of the parameter Y satisfying tp(Y) 
are isomorphic. They only differ regarding the concrete subsets Y of Vq used as sets of 
vertices of H. 

Lemma A. 1.1. Let r : STK(R) — > STK(Q) be an MS (or FO) transduction. Let 
< be a binary relation symbol not in R U Q. One can transform r into an MS (or FO) 
transduction r ' : ST1Z{R U {<}) — ► ST1Z{Q U {<}) such that, for every S in STH(R) 
and every linear order ■< on its domain, t'(S, ■<) = (t(S), ■<') where ■<' is a linear order on 
the domain of t(S). 

Proof. Let r be /c-copying. For w = (<, j) G {<}*k it is easy to define FO formulas 9 W 
belonging to MS(R U {<}, W U {x 1 ,x 2 }) such that, in t'(S, ±) 

(di,i) ■<' (d2,j) if and only if either d\ ~< d 2 or {d\ = d 2 and i < j) . 

It is clear that -<' is a linear order on the domain of t{S) if ■< is one on S 1 . □ 

The fundamental property of MS transductions. The following proposition says that 
if T = def A (S,~/), then the monadic second-order properties of T can be expressed as 
monadic second-order properties of (S, 7). The usefulness of definable transductions is 
based on this proposition. 

Let A = (</?, ipi, ■ ■ ■ , ipk, (O w )weQ*k) be a (Q, ^-definition scheme, written with a set of 
parameters W. Let V be a set of set variables disjoint from W. For every variable X in V, 
for every % = 1, • • ■ , we let be a new variable. We let V':= {X{/X G V, i = 1, • • ■ , &:}. 
Let £ be a structure in STTZ(R) with domain D. For every mapping r/ : V' — > V(D), we 
let 7j k : V — > V(D x [k]) be defined by r, k {X) = r?(Xi) x {1} U • • • U r?(X fc ) x {fc}. With 
this notation we can state 

Proposition A. 1.2. For every formula (3 in MS(Q,V) one can construct a formula @ft 
in MS(R, V' U W) such that, for every S in STTZ(R), for every assignment 7 : W — ► S, 
for every assignment rj : V' — ► S we have 

(S, rj U 7) |= /?* if and only if def A (S, 7) is defined, 

n k is a V -assignment in def a(S,j), 

and {def A (S,^y),rj k ) \= (3 . □ 

If the definition scheme and the formula (3 are FO the formula /?* is also FO. Note that, 
even if T = def A (S, 7) is well-defined, the mapping r] k is not necessarily a F-assignment in 
T, because rj k (X) may not be a subset of the domain of T which is a possibly proper subset 
of Ds x {1, . . . , k}. We call the backwards translation of (3 relative to the transduction 
def A . 

The composition of two transductions is defined as the composition of the corresponding 
binary relations. If they are both partial functions, then one obtains the composition of 
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these functions. The composition of two domain extending MS (or FO) transductions is 
domain extending. 

Proposition A. 1.3. 

(1) The composition of two MS (or FO) transductions is an MS (or an FO) transduction. 

(2) The inverse image of an MS-definable class of structures under an MS transduction is 
MS-definable. A similar statement holds with FO instead of MS. □ 

Appendix 2: Clique- width 

Clique-width is, like tree-width a graph complexity measure. It is defined and studied 
by Courcelle and Olariu in JH], and also in |lfl| I27j. Graphs are simple, directed or not, 
and loop-free. 

Let C be a set of k labels. A C-graph is a graph G given with a total mapping from 
its vertices to C, denoted by laba- Hence G is defined as a triple (Vg, edg G , lab G ). We call 
lab G (v) the label of a vertex v. The operations on C-graphs are the following ones 

(i) For each i E C, we define a constant i for denoting an isolated vertex labelled by i. 

(ii) For i, j EC with i ^ j, we define a unary function addij such that 

addij(V G , edg G , lab G ) = (V G , edg' G , lab G ) , 

where edg' G is edg G augmented with the set of pairs (u,v) such that lab G (u) = i 
and lab G (v) = j. 

In order to add undirected edges, we take: addij(addj j i(V G , edg G , lab G )). 

(iii) We let also ren^j be the unary function such that 

reni^j(V G , edg G , lab G ) = (V G , edg G , lab' G ) , 

where lab' G {v) = j if lab G (v) = i, and lab' G (v) = lab G (v), otherwise. This mapping 
renames into j every vertex label i. 

(iv) Finally, we use the binary operation © that makes the union of disjoint copies of its 
arguments. Hence G G ^ G and its size is twice that of G. 

A well-formed expression t over these symbols will be called a C -expression, or a It- 
expression if we are only concerned with the size k of C. Its value is a C-graph G = val(t). 
The set of vertices of val(t) is (or can be defined as) the set of occurrences of the constant 
symbols in t. However, we will also consider that an expression t designates any graph 
isomorphic to val(t). The context specifies whether we consider concrete graphs or graphs 
up to isomorphism. 

A graph is considered as a graph all vertices of which are labelled in the same way. 
The clique-width of a graph G, denoted by cwd(G) is the minimal k such that G = val(t) 
for some fe-expression t. A graph with at least one edge has clique- width at least 2. The 
graphs K n , S n -i have clique- width 2, for n > 3. 

If we need to define graphs with vertex labels from a set L, then we use constant 
symbols i a for i in C and a in L. The labels from L are not changed, and do not affect the 
other operations. The clique-width of a graph does not depend on the possible labelling 
of its vertices. By contrast, it depends strongly on edge directions. Cliques and transitive 
tournaments have clique- width 2 but tournaments have unbounded clique- width (>8 ). To 
build a graph with labelled edges we use the operation add a ^j to add edges labelled by a 
from the vertices labelled by i to those labelled by j. 
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Lemma A. 2.1. 15 (1) The clique-width of a graph is equal to the maximum clique-width 
of its induced subgraphs. 

(2) The clique-width ofG[H/u] is equal to the maximum of cwd(G) and cwd(H). 

(3) The clique-width of a graph is equal to Max{m, 2}, where m is the maximum clique- 
width of the prime graphs of its modular decomposition. □ 

Lemma A. 2. 2. jlUj For every MS transduction r from graphs to graphs there exists a 
fonction f such that T £ t(S) implies cwd(T) < f(cwd(S)). □ 

Lemma A. 2. 3. Let G be a graph let u be a vertex of indegree 0, and v be a vertex of 
outdegree 0. Let G' be obtained from G by fusing u and v. Then cwd(G') < 4cwd(G). 

Proof. Let k = cwd(G) and E be a {1, ... , /c}-expression for G, considered as a {l}-graph. 
For every x in Vg — {u, v}, we let its type be 1 if u — ► x and x — * v, be 2 if u — * x and 
x — ► v does not hold, be 3 if x — ► v and u — ► x does not hold, and otherwise. 

We let H be the graph G[Vg — {u, v}} where every vertex has label (1, i) (instead of 1) 
and i is its type. We let C = {1, . . . , k} x {0, 1, 2, 3}. From E, by deleting the constants 
which define u and v, and by modifying the graph operations so that every label a of a 
vertex is replaced by (a, i) where i is its type, one can construct a C-expression E' defining 
H. Let p be a label, e.g., (2,0), which does not label any vertex of H. The graph G' with 
all its vertices labelled by p is the value of 

r en ( i >0 )^ p o ren (li i)^ p o ren {l ^p ° ren {l ^ p o add p ^^ o add p ^ 2 ) o add^^p o add(i )3 ) )P 

at E' © p. Hence G' has clique- width at most Ak. □ 
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